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INTRODUCTION 

In  many  practical  situations  the  experimenter  is  faced  with  the 

following  problem:  given  a set  of  k populations  t r , . . . ,ir  with 

parameters  respectively,  decide  if  (1)  all  it  are  same, 

and  if  not  (2)  find  the  subset  it.  , (1  _<  t < k)  which,  in 

X1  xt 

some  sense,  is  better  than  the  rest  of  the  given  popualtions.  It  is 
clear  that  the  classical  tests  of  homogeneity  are  not  designed  for 
this  problem,  since  they  fail  to  provide  any  information  regarding 
the  second  question,  which  may  be  more  meaningful  than  the  first 
one.  A partial  answer  to  this  question  was  provided  by  Mosteller 
(1948)  who  proposed  a test  of  HQ:  = . . . = 0 against  the 

slippage  alternatives  H ..  : Oj  = ...  = 0i  - A = ...  = 0k>  i = l,...,k. 

Paulson  (1949)  proposed  a multiple  decision  rule  for  testing 
slippage  in  normal  means.  Since  then,  many  authors  have  contri- 
buted to  the  theory  of  slippage  tests.  Karlin  and  Truax  (1960) 
have  given  a decision  thoeretic  formulation  of  the  single  slippage 
problem  and  have  derived  the  class  of  symmetric  Bayes  proceedures. 
Hall  and  Kudo  (1968),  Hall,  Kudo  and  Yeh  (1968),  and  Kudo  and  Yeh 
(1970)  have  considered  the  slippage  problem  as  a problem  in  testing 
of  composite  statistical  hypotheses.  Van  Ryzin  (1970)  has  discussed 
the  slippage  problem  from  the  Empirical  Bayes  approach  and  has 
derived  asymptotically  optimal  procedures.  A discussion  of  some  of 
the  availabe  results  is  given  in  Doornbos (1966) . 


The  theory  of  slippage  tests,  however,  cannot  be  applied  to 
more  general  situations,  and  provides  only  partial  answer.  Bahadur 
(1950)  is  among  the  earliest  authors  to  consider  a different 
approach  and  investigate  the  so-called  selection  and  ranking 
procedures.  References  could  be  made  to  Bechhofer  (1954),  Gupta 
(1956,  1965)  Lehmann  (1961) . 

Generally  speaking,  two  formulations  for  selection  and  ranking 
problems  have  been  considered.  Suppose  population  it.  has  distribution 
function  F(x,0^),  i = l,...,k,  and  let  0^  denote  the  i-th  ordered 
0i;  the  correct  pairing  of  and  0 j. . ^ is  unknown.  To  fix  ideas, 
suppose  is  better  than  ir^  (j  i)  if  0^  > 0_.,  and  consider  the 
problem  of  selection  of  the  best  population,  i.e,  the  population 
associated  with  0 , . 

P0 

The  first  formualtion  is  the  indifference  zone  approach  of 
Bechhofer  (1954),  in  which  a procedure  is  defined  to  select  one 
population  subject  to  the  condition  that  the  probability  of  selecting 


the  best  one  is  at  least  P*  whenever  0. - 0 ,,  . , > 6*,  where  P* 
l lkJ  lk-lj  — 

(r  < P*  < 1)  and  6*  > 0 are  preassigned  constants. 

The  second  formulation  is  due  to  Gupta  (1956)  in  which  a 
random  number  of  populations  is  chosen  so  that  the  probability  that 
0^  is  included  in  the  selected  subset  is  at  least  P*,  where 
P*  6 (£,  1)  is  a preassigned  constant. 

Selection  and  ranking  problems  have  also  been  considered 
from  a Bayes  approach.  The  problem  of  selecting  a subset  containing 


the  'best'  has  been  considered  by  Deely  and  Gupta  (1968),  where  it 


I 

is  shown  that  under  certain  conditions  the  Bayes  rules  for  a linear 
loss  function  selects  exactly  one  population.  Guttman  and  Tiao  (1964) 
have  studied  some  best  population  problems  when  the  a priori  dis- 
tribution of  parameters  is  known.  Deely  (1965)  has  investigated 
emperical  Bayes  procedures  for  the  problem. 

Interest  has  recently  developed  in  Bayesian  selection  rules 
for  nonlinear  loss  functions.  Eaton  (1967)  has  proved  that, 
under  certain  conditions,  the  natural  selection  rule  is  (i)  Bayes 
for  any  symmetric  a priori  distribution  on  (0  , . . . , 6.)  (ii)  has 

I K 

uniformly  minimum  risk  among  symmetric  rules  and  (iii)  is 
minimax  and  admissible.  Goel  and  Rubin  (1975),  Chernoff  and  Yahav 
(1977),  Bickel  and  Yahav  (1977)  and  Hsu  (1977)  have  considered  Bayes 
rules  for  selecting  a subset  containing  the  best  population.  Typically, 
these  Bayes  rules  are  difficult  to  compute.  It  has  been  observed 
by  Chernoff  and  Yahav  (1977)  and  Hsu  (1977)  that  Gupta-type 
procedures  are  reasonable  approximations  of  the  Bayes  procedures. 

A slighly  different  situation  arises  when,  in  addition  to  the 
k populations  tk  , a control  population  ir ^ is  given  and  the  goal 
is  to  select  a subset  of  populations  which  are  'better'  than  control. 
Gupta  and  Sobel  (1958),  Huang  (1975)  and  others  have  considered  this 
problem. 

The  present  thesis  consists  of  investigations  of  multiple 
decision  problems  mentioned  earlier,  and  some  related  topics.  In 
Chapter  I a problem  in  multiple  slippage  testing  is  considered,  and 
the  class  of  symmetric  invariant  Bayes  rules  to  test  if  all 
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parameters  0^  are  the  same  against  the  alternative  hypotheses  that 
some  (unknown)  subset  of  (0  ,...,0^)  of  a given  size  t ( 1 <_  t < k) 
has  slipped  to  the  right  by  the  same  amount  A > 0 have  been 
derived  for  location  and  scale  type  densities  admitting  sufficient 
statistics.  The  t-slippage  problem  has  also  been  considered  in 
nonparametric  situations,  and  locally  best  rules  based  on  ranks 
have  been  derived. 

The  rest  of  the  thesis  pertains  to  Bayes  and  Gupta-type 
selection  procedures.  In  Chapter  II  Bayes  rules  for  the  problem 
of  selecting  a subset  containing  the  t best  (t  known,  1 < t < k) 
have  been  investigated.  For  a loss  function  which  is  linear  in 
0 ^ , it  is  shown  that,  under  certain  conditions,  the  Bayes  rule 
selects  exactly  t population  which  are  associated  with  the  t 
largest  observations.  This  generalizes  a similar  result  of  Peely 
and  Gupta  (1968)  for  the  case  of  t = 1.  When  the  loss  functions  are 
non-linear,  Bayes  rules  are  analytically  and  computationally 
intractable.  Some  special  cases,  namely,  normal  exponential, 

Poisson  and  gamma  distributions  have  been  considered  and  Bayes 
rules  derived  for  natural  conjugate  a priori  distributions  of 
the  parameters  0^. 

In  Chapter  III  some  selection  problems  for  normal  populations 
have  been  considered  and  Gupta-type  selection  rules  based  on 
sample  medians  of  odd  number  of  observations  have  been  investigated. 
Exact  and  asymptotic  results  are  obtained.  In  the  special  case  of 
equally  spaced  normal  means,  the  proposed  rule  is  numerically  compared 


to  the  selection  rule  of  Gupta  (1965),  which  is  based  on  sample 
means.  As  expected,  the  sample  means  procedure  appears  to  be  better 
than  the  procedure  based  on  medians.  It  is  shown  that  the  asymptotic 
relative  efficiency  (ARE)  of  the  rule  based  on  sample  median 
relative  to  the  rule  based  on  sample  means,  when  the  normal  means 
are  in  a slippage  configuration,  is  equal  to  2/tt.  However,  in 
case  the  normal  populations  are  contaminated,  the  rule  based  on  sample 
medians  shows  a significant  improvement  over  the  rule  based  on  means 
in  terms  of  the  ARE. 

Chapter  IV  consists  of  investigations  of  Bayes  and  Gupta-type 
rules  for  selecting  a subset  which  contains  all  populations  'close' 
to  a given  control.  A slightly  different  problem,  in  which  the  t 
(1  £ t < k)  populations  'closest'  to  control  are  to  be  selected, 
is  also  studied  using  Bayes  and  empirical  Bayes  approaches. 
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CHAPTER  I 

TESTING  MULTIPLE  SLIPPAGES 

1.1  Introduction 

The  problem  of  multiple  slippages  can  roughly  be  described 
as  follows:  We  are  given  k populations  . The  goal  is 

to  decide,  on  the  basis  of  a sample  from  each  population,  if  all 
are  same,  and  if  not,  which  of  the  t (1  <_  t <_  k-1)  tk  are  dif- 
ferent from  the  remaining  k-t,  where  t is  a known  integer.  A 
similar  problem  in  multiple  slippage  testing  involves  an  additional 
control  population,  and  the  goal  then  is  to  compare  the  k populations 
it  to  the  control  population  and  decide  if  all  the  populations 

have  parameters  equal  to  that  of  the  control,  and  if  not,  which  t of 
the  k populations  are  different  from  control.  The  latter  problem 

will  be  referred  to  as  the  controlled  case.  If  ir.  , ...,tt.  are 

11  xt 

different  from  (it.:  j ^ i , l = l,...,t)  we  say  that  the  subset 

J *> 

{ tt  ^ , ...,tk  } has  'slipped'.  In  this  chapter  we  consider  a special 
case,  in  which  the  amount  of  slippage  is  assumed  to  be  the  same 
for  all  the  populations  which  have  slipped. 

Karlin  and  Truax  (1960)  have  derived  a class  of  symmetric 
Bayes  procedures  for  the  single  slippage  problem,  which  corresponds 


: 
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to  t = 1.  Hall  and  Kudo  (1968),  Hall,  Kudo  and  Yeh  (1968),  and 
Kudo  and  Yeh  (1970)  have  considered  the  problem  of  single  slippage 
in  the  framework  of  testing  composite  statistical  hypotheses. 

In  this  chapter  the  results  of  Karlin  and  Truax  (1960)  have 
been  extended  to  the  case  of  multiple  slippage  (t-slippage)  when 
t is  a known  integer  between  1 and  k-1.  A discussion  of  some 
available  rules  for  the  t-slippage  problem  is  given  in  Doornbos 
(1966). 

In  Section  1.2  definitions  and  notations  used  in  this  chapter 
are  introduced,  and  a decision-theoretic  formulation  of  the  problem 
is  given.  In  Section  1.3  the  general  form  of  the  symmetric  Bayes 
procedures  for  the  t-slippage  problem  is  derived.  Some  specific 
examples  are  considered  in  Section  1.4. 

In  Section  1.5  (1.6)  the  class  of  symmetric  invariant  Bayes 
procedures  for  the  t-slippage  problem  has  been  derived  for  location 
(scale)  parameter  densities  having  a sufficient  statistic. 

Section  1.7  consists  of  investigation  of  several  related  problems 
in  ranking  and  selection. 

In  Section  1.8  the  problem  of  t-slippage,  using  the  notions  of 
slippage  introduced  by  Karlin  and  Truax  (1960),  has  been  discussed  in 
nonparametric  situations.  Some  locally  best  tests  based  on  ranks 
have  been  derived.  In  Section  1.9  selection  of  a rule  from  the  class 
of  procedures  obtained  in  this  chapter  is  discussed. 
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1.2  Decision-Theoretic  Formulation  of  the  Multiple  Slippage  Problem 

k 

Let  x = 6 ]R  , the  k-dimensional  Euclidean  space,  be 

an  observed  value  of  a random  vector  X = (XJ.....X.)  which  has 
probability  density  function  (pdf)  f(Xl,...,xk;  01 , . . . ,eR)  with 
respect  to  a a-finite  measure  p.  The  form  of  the  function  f is 
known  and  Q^s  are  unknown,  6 € © cjr  fQi  i = l,...,k  where  the  set 

©has  at  least  two  points. 

Let  be  the  symmetric  group  on  {l,...,k}  i.e.,  the  group  of 
all  permutations  \p:  {1 , . . . ,k>  (1 , . . . ,k} . For  z £ lRk  and  <p  € SR 

define  z ^ by  (z^)^  = z^,  where  (z^)^  represents  the  i-th  coordinate 

of  the  vector  z , . 

-'I' 

The  pdf  f(x;0)  and  the  measure  p are  assumed  to  be  symmetric 
in  the  following  sense: 

6^)  = f(x;§)  Vi/,  €Sk,  x € JRk,  0 (1.2.1) 

dp(x^)  = dp(x)  V !/,  6 Sk>  (1.2.2) 

We  assume  that  f(x;0)  also  has  the  property  M of  Eaton  (1967). 


Definition  1.2.1:  A family  of  density  functions  (f  (x;0):  a FA) 

a - - 

is  said  to  have  property  M if  for  each  a £ A,  and  each  i,j  (i  / j), 
1 <_  i,j  <^1  the  following  holds: 


x.  > x. , 0. 
i - J i 


where  to.  . € S. 

riJ  k 


> e -f0(x;?)  > f (;;e  ) 

ij 

is  such  that  for  1 < i,  < k, 


(1.2.3) 


_ 


It 


j if  l = i 
U)  = ^ i if  l = j 

8.  otherwise. 


Let  S=  {J  c{l,2 k}  : |j|  = t) 

where  |j|  denotes  the  size  of  the  subset  J.  For  any  J e S,  let 


®J  = — *6k)  £ 6 ; 8j  = w ^ if  j l J’  j = o)eIR,A>0}. 

Also  let  eQ  = {e  = (er...,ek)  e ek:  0j  - j - u>eIR}. 

The  problem  is  to  test 


Ho:  ! e ®0 


against 


(t)  alternative  hypotheses 


Hj:  0 e 6j,  J e 8. 

The  action  space  G = (a^a^  J e 8}  consists  of  (k)  + 1 elements. 
The  action  aQ  corresponds  to  acceptance  of  H and  a to  the  accep- 
tance  of  H . 

%J 

For  J e {0}  U 8,  let  Lj(0)  denote  the  loss  incurred  in  taking 
action  Bj  when  0 is  the  true  parameter  value.  Here  L:  0k  x q -+  ir 
is  a u-measurable  function,  and  is  assumed  to  satisfy  the  following: 

(1)  Lj(0)  = L _i  (0.)  V J €{0}  U 8 and  € S where,  for 

^ - O j i ■ • • ij^Ji  J = { ip  j j , • . . , j ^ } 


(2)  Lj (0)  < Lj (0)  V 0 e ejt  I,J  € {0}  U 8,  1 t J. 
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(3)  Lj(0)  = Lj(0)  V0  | 0^  , I, J 4 L*  Otherwise  L^. (0)=Lq(0)  V I. 

For  the-  problem  of  comparing  the  k populations  » • • • »1rk  t0  a given 

control  population  tt^,  the  above  needs  some  modifications.  Here  we 

k+1 

are  given  an  observation  (Xj, . . . .x^.y)  e 1R  from  a population  with 
density  function  f(Xj, . . . .x^y;  0j , . . . ,0^,0^  = f(x,y;  0,0  ) with 
respect  to  a a-finite  measure  p;  0^  e © c IR  , i = 0,l,...,k. 

The  density  function  f is  assumed  to  have  the  following  symmetry: 


f(x,y;0,0o) 


k + l k+1 

fQyy^VV  €V(-'y)  em  €6 


(1.2.4) 


It  is  also  assumed  that  the  family 
(y,0Q)  Affine)  has  property  M. 
The  problem  is  to  test 


{f(y,e0) t-;-) 


f(x;y;0,0Q)  : 


.k+l 


HQ:  (0,0O)  e 60  = {(01,...,0k,0o)  € e>  : 0j  = 0Q,  i = 1 k} 


against  ( ) alternatives 


‘V  ‘S-V  tej*  {(V----VV  £®k“:  °j  'eg**  If  j \ j-  J=1 “• 


QgClR,  A>0}  where  J e S. 

The  action  space  6 = (a  ,a  : J e §}  again  has  ( >4-1  elements.  The 

U J i 

action  aQ  decides  in  favor  of  HQ,  and  a^  in  favor  of  H j . 

k+i 

The  loss  function  L:  © * Q -+  ir  is  a p-measurable  function,  with 

Lj(0,0q)  representing  the  loss  of  taking  action  a^,  when  (0,0Q) 
is  the  true  value  of  the  parameter.  The  assumptions  on  the  loss 
functions  in  this  case  are 


11 


(1')  Lj(0»0o)  = L _1  (!^0O)  v J E S,  4*  e Sk  and  (0,0O)  € e 

Ip  J 


(2>)  LJ(e,e0)  < LI(e,e0)  v (e,e0)  € eJf  i.j  ( (o)  u i / j. 


(3')  Lj(0,6q)  a Lj(0,0q)  V (®i®q)  e <E^»  i L. Otherwise 

Li(e,y  - V®.  V v x- 


1.3  Bayes  Solutions 


We  first  derive  the  Bayes  procedures  for  the  uncontrolled  t- 


slippage  problem.  Since  the  problem  is  invariant  with  respect  to 


the  group  S^,,  we  can  restrict  attention  to  symmetric  procedures. 


Definition  1.3.1:  A symmetric  decision  function  is  a p-measurable 


vector  function  cp  = (cpQ,  cpj : I e S)  : lRk  -*■  [0 , 1 ]m  where  m = (^)  + 1, 


such  that 


(i)  2 cpT(x)  = 1 V x fc  IR 

I€(Q}U  S 


(ii)  tpj(x)  = cp  _ j (x  ) V * ( Sk,  x 6 IR  , I f (0)  U g. 


In  our  problem,  for  I 6 {0}  U S,  cpj(x)  represents  the  probability 


of  accepting  Hj,  given  x is  observed.  In  the  terminology  of  Fereuson 


(1967),  tf  is  a behavioral  decision  function. 


Definition  1.3,2:  If  ft  denotes  the  set  of  all  possible  behavioral 


decision  functions  for  the  t-slippage  problem,  the  risk  function 


p : A x © + IR  is  defined  by 


P(<P,§)  - \ I LjfejtpjCjOffx.e)  d^ (x) 


IR*  It(0}U  g 
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The  following  two  results  will  be  used  in  deriving  the  form  of 
the  Bayes  procedures;  these  are  slight  modifications  of  Lemma  3.1 
and  Theorem  3.1  of  Karlin  and  Truax  (I960). 

Lemma  1.3.1:  The  risk  function  of  a symmetric  decision  function 

is  a symmetric  function  of  0. 

Theorem  1.3.1:  Any  symmetric  Bayes  procedure  for  the  t-slippage 

problem  is  Bayes  against  a symmetric  a priori  distribution. 

Let  (2  = ©Q  U ( U Gj). 

Ie  b 

The  set  ft  can  be  represented  as  follows: 

U = {(I,o), A):  I e {0}  U b>  m e IR , A > 0} 

where  I = 0,  A = 0 iff(I,cj,A)  6 €^,  and  I e b,  A > 0 iff(I,oj,A)  €:  ©j . 
Let  G be  an  a priori  distribution  on  ft,  and 


Cn  = / dG 


= L 


Also  let  Gq(w)  denote  the  conditional  distribution  function  of  w 
given  A = 0 and  Gj(io,A)  denote  the  ioint  conditional  distribution 
given  the  set  I has  slipped. 

It  is  easily  seen  that  G is  symmetric,  i.e.,  G(0)  = G(0^) 

V 0 e ft,  tt  e S^,  iff  all  I e b are  equal  and  all  Gj(w,A), 

I e b are  identical.  Then,  since  we  are  interested  in  symmetric 
procedures,  using  Theorem  1.3.1  we  can  take 
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€j  = £ >1  e a,  where  £ e [0,1], 

and 

Gj(w,A)  = G(u>,A)  V I £ S,  u e F , 4 > 0, 

Let  fn(x;w)  and  f (x;w,A)  denote  the  density  under  H„  and  H,, 
respectively.  Then  comparing  Bayes  posterior  risks  we  obtain 
yx)  = 1 if  x t Rq,  where  RQ,  a symmetric  set  in  lRk,  is  given  by 

RQ  = {x  e IRk:  eoT[L0((o)-LJ(c1.)]f0(x;«)dG0(u) 

+ ?/[Lq(J,i»),  A)-Lj(J,co,  A)]fj(x;to,A)dG(ti),  A) 

<0  KJ  c S)  (1.3.1) 

and  for  I e & 

yx)  = 1 if  e0/[Lj(u)  - Lj (w)  ] fQ (x; u) dGQ (to) 

+ C/[Lj  (J,o),A)  - Lj(J,a),A)]fj(x;a),A)dG(u),A) 

+ C/[Lj  (I,w,A)  - Lj (I  ,o),  A)  ]f  j (x;w, A)dG (a), A) 

< 0 V J e (0)  U i,  J t I.  (1.3.2) 

For  J = 0,  the  second  term  in  the  expression  (1.3.2)  vanishes,  and 
(1.3.1)  implies  that  x £ RQ.  Using  assumption  (1)  on  Lj(0),  it  can 
be  seen  that 

Lj(I,o),A)  - L^J.ai.A)  = -[LjCJ.gj.A)  - Lj(J,u>,A)] 
and  hence  taking  J e & in  (1.3.2)  gives 


m 


14 

Cfj(x)  = 1 if  J[Lj(I  ,<i>,  A)-Lj  (I  ,<d,A)  ] [fj  (x;u>,  AJ-fjCxjto,  A)  ]dG(w,  A)  < 0 

V J i I.  (1.3.3) 


It  follows  from  assumption  (2)  on  the  loss  functions  and  the  property 
M of  the  density  f that  the  inequality  in  (1.3.3)  holds  iff  the  set  I 
is  such  that 


f Xi ' 1 6 I}  = {x[k-t+l]’"-,x[k]} 


where  xrTl  <...<  xr.  , are  the  ordered  observations  x.. 
[I]  - - fk]  l 

Hence  we  have,  for  I e S 


<Pj  (x)  = 1 if  x £ Rq  and  the  set  I is  as  defined  by  (1.3.4). 
Thus  we  have  the  following  theorem: 


Theorem  1.3.2:  If  f(x,0)  and  u satisfy  the  symmetry  conditions 

(1.2.1)  and  (1.2.2),  respectively,  f has  property  M,  and  loss 
functions  satisfy  assumptions  (1),  (2),  (3)  of  Section  1.2,  then 
any  symmetric  Bayes  procedure  cp  is  of  the  following  form: 

k 

cfg(x)  = 1 if  x fc  Rg,  a symmetric  set  in  IR 

cpj(x)  = 1 if  x € Rg  f)  (x  lRk:  {x^  i t I)  = fx[k.t  + 1^ » • • • >x[k] } ) 


Remarks : 

1.  The  form  of  the  set  Rfl  depends  heavily  on  the  density  function  f, 
as  illustrated  by  the  examples  discussed  in  the  next  section. 


2.  For  t = 1,  Karlin  and  Truax  (1960)  assume  that 


L. (0)  = L .(6  ). 
J - irj  -ir 


(1.3.5) 


P1" 


i 


* ■ ‘^1 
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It  is  clear  from  (1.4.3)  that  fj(u;A)  is  an  increasing  function  of 
hj(u)  and  therefore,  from  (1.4.2)  we  have 


Rq  = (u:  hj(u)  < c , M J c 8) 


= ( (x,y) : max  (Xj  - ^Z.)  < c) 
J G J 


(1.4.4) 


i 


where 


i * j£j 

XJ  t 


Hence  every  symmetric  invariant  Bayes  procedure  has  the  form 
<fy(x,y)  = 1 if  (x,y)  e Rq 

^(x.y)  = 1 if  (x,y)  i R0  and  (x.:  i € I)  = {x[k_t+1]f . . . ,x[k] } 
where  the  set  RQ  is  given  by  (1.4.4). 

(b)  The  normal  distribution  with  known  variance  (uncontrolled  case) 


The  additional  assumption  on  the  loss  function  is 


^1  (^i^c* . . • ,6^+c)  * Lj  (6j » • • • »6k) , V c e 1R,  I e (0)  US  and 


(0j  , • • • * 0j^)  £ 0 . 


The  problem  is  again  invariant  with  respect  to  the  group  of 

translations  and  a maximal  invariant  is  u = X.-X.  ,...,U  = 

1 IK  R ■■  1 

Xk.rXk.  The  joint  pdf  of  Uj,...,Uk_j  is 


f (Uj# . . . »uk_ w1,...,wkl)  = C exp[-  j Q (u ; u) ] 


where 


k-1  k-1 


Q(u;w)  = l l A1J  (u.-<u.)(u.-o).) 
i*l  j*l  1 i J J 


(1.4.5) 


u.  = e.-e. , x 

i i k’ 


k-1 

— ^ i = 3 


- f ^ i/j 


It  is  easily  verified  that  f(u;w)  has  property  M.  We  now  calculate 
fj(u;A),  the  joint  pdf  of  for  6 ©j,  J e s-  There  are 
two  possibilities: 


(i)  k£J 


0 if  j f J,  j / k 


-A  if  j i J 


9j(u;4)  - Q0(U)  * - talGj  - !£■  5) 


l 


k-1 

l u. 

_ „ - i=l  1 

V °’  U = ___ 


(1.4.6) 


In  this  case 


A if  j £ J 


0 if  j | J 


Substituting  the  values  of  uk  in  the  expression  (1.4.5)  we  find 
that  the  form  of  Q^(u;A)  remains  the  same  as  (1.4.6). 

Then,  as  in  case  (a),  the  set  is  given  by 


R0  = “j  ' ~T~  “ < c V J e g> 


3 (x:  x.  - x < c V J e »)•■ 


(1.4.7) 


Hence  the  symmetric  Bayes  rule  in  terms  of  the  observations 
are  given  by 
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cp0(x)  = 1 if  x fc  RQ 


(Pj(x)  = 1 if  x 4 RQ  and  (x^-  i € 1}  = {x 


[k-t+l]'"-’x[k]} 


where  RQ  is  given  by  (1.4.7). 

2.  (a)  The  normal  distribution  with  unknown  variance  (controlled 
case) 

yj  and  (i  = l,...,k;  j = l,...,n)  are  n independent  observations 

from  k+1  independent  normal  populations  with  means  6„  and  6 

0 i 

(i  ~ 1»  • ■ • >k)  and  a common  unknown  variance  a2.  The  loss  function 
is  assumed  to  satisfy 

e +<y  &+a  en+a 

*^1  l.  g »•••>  g » g » g'J  = Lj  (0  j , . . . »®kf0Q  »°) 

, V a fc® , 6 > 0 and  I € (0)  U {§}. 

The  problem  is  invariant  under  groups  of  location  and  scale 

transformations,  and  a maximal  invariant  for  both  groups  is 

XrY  Xk-? 

Uj  = — g — , ....  Uk  = -g — - , where 


x.  = 


? - 1 


i = l 3 


k n 


= l 1 (xirXi r ♦ [ (y 

i=l  j=l  13  1 j=l  3 


-Y)2. 
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The  joint  pdf  of  U.,...,!!.,  as  given  in  Karlin  and  Truax  (1960), 


k k 


f(u;n)  = c / exp  [-  -j  \ J A ^ (u.  -n- ) (u . s-n • ) - — y]s  ds 

0 z i=l  j=l  is  i j 3 ^ 


where 


kn 

k+T  lf  1 ’ 3 


" k+T  if  1 i 


It  is  easily  seen  that  the  joint  pdf  of  given  the  set 

J ( g has  slipped,  is  given  by 


f I (u; 6)  = C A(S)g(u )/  exp[t6hT(u)-  ^r]sn'S  ds 
J - ‘ o J - 2 

where  6 = |,  A(6)  = exp[-  62] 


g(u)  = k k n+k 

( l l x‘iu  » .i)2'"-1) 

i»l  j=l  1 J 


tn(V  k+T  “J 


and  h (u) 

kJ  " 


k k 


( I l Alju.u  +1)5 
i=l  j=l  1 J 


We  now  verify  that  the  pdf  f(u;n)  has  property  M. 

For  each  fixed  s > 0,  the  exponent  of  the  integrand  in  the  expression 

for  f(u;n)  can  be  written  as 

. 2 

QA(u;n)  ~ ' 2 t Csu-n) ’ A Csu-n) ] - 


where  A = (A  J)  = n 1^  - e'e. 


e,  . * (1.....1)  £ m\ 


21 


It  follows  from  proposition  (2.2)  of  Eaton  (1967)  that  f(u;n) 
has  property  M.  Hence  all  the  assumptions  are  met,  and  Theorem 
1.3.2  gives  the  form  of  the  symmetric  invariant  Bayes  rules,  except 
for  the  set  Rq.  The  set  Rq  is  given  by  the  inequalities 


C0(L0"LJ)f0(-)  + ?/tL0(J.(S)-Lj(J»A)]fJ(u;5)dG(6)  <0  Y J e S. 
Since  fj(u;<5)  is  monotonically  increasing  in  hj(u)  we  have 
tr»(Uj-  rTj  u) 

Rft  = (u: — < c ,V  J e §). 


0 


k , . 

{1*nU,ui  - 


1=1 


Changing  to  original  observations  x^,y,  we  have 


x.+y 

cft) (xi j *y)  = 1 if  * Rq  = {(xij>y):  --  *-k 


x.+y 


< c 


[ S +n  l (x  . - -j— r)  1 
i k+l'+ 

X 4-  v 

n(y-  -j^-)2],  y j c 


*Pl  (xij  *j0  = 1 if  (xij,y)  i R0  and 


I 


(x.:  i € I)  - {x[k_t+1].---»x[k]> 


where  Xj  = 


(b)  The  normal  distribution  with  unknown  variance  (uncontrolled 
case) 

Here  x^j  (i  = l,...,k;  j = l,..,n)  are  n independent  observations 


22 


from  k normal  populations  with  means  0.  and  a common  unknown  variance 


o".  Additional  assumption  on  the  loss  function  is 


0 ,+a  0,+Qf 

Lr(  g—>  • ■ ■ > 3-.  g)  = LI(e1,--.,ek,a) 


Va  t 1R,  6 > 0,  I t {0}  U S. 


Then,  as  in  case  (a),  we  have 


XJ  - x 


R„  = { x . . : max  [ , 

0 ij  „ 1 k n 9 ! 

< I l 


] < c} 


k 

l x. 


i=l  J=1 


where  x = 


i=l 


I x. 

i€J 


XJ  = t 


and  the  Bayes  rules  have  the  form 


W * 1 if  xij  e Ro 


• 1 if  X..  i R()  and  {x^  16  1}  = x[k)}. 


3.  (a)  The  gamma  distribution  (controlled  case) 

Here  Xj,...,x^,y  are  observations  from  k+1  independent  gamma 
populations  with  unknown  scale  parameters  0^ , . . . , 0^ ,0^ , respectively 
and  a common  known  shape  parameter  p > 0. 

Additional  assumption  on  the  loss  function  is 


01  0k  0O 

c ’ P = V c > 0,  I e (0}  u &■ 


Xi  Xk 

The  joint  pdf  of  maximal  invariants  = y— , = y—  is 


i 
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f (li^  ) • • • ) . • . , ) — 


c n to.P  U?  1 
i=i  1 1 


k u.  ,,  .. 

a*  I -i)<k*»p 


i-i  “i 


where  c is  a constant,  and  w. 


0. 

1 


i 


It  is  easily  verified  that  f(u;w)  satisfies  the  condition  (1.2.3) 
and  hence  f has  property  M. 


Then  the  set  RQ  is  given  by  the  inequalities 


CqCLq-Lj)^^)  + /[L0(J,A)  - LJ(J,A)]fJ(u;6)dG(6)  < 0,  V J e & 


where  6 = — , and 

60 


J 'i+«)pt 


t- 


tu. 


-]• 


(l+6r  (1  6 fc“j  J (kfl)p 

1+6  1+ku 


tu. 


Since  f (u;6)  is  an  increasing  function  of  , we  have 

J ' 1+ku 


tu, 


Rq  = {u:  max 


< c} 


J6S  1+ku 


= {(x,y):  max  < c) 

JeS  y+kx 


x. 


where  x,  = 


jeJ 


k 

I x, 


x = 


■■  - - — 


j 


rr — : r 


LIT  LTT^T:'*? 


(b)  The  gamma  distribution  (uncontrolled  case) 

Similar  analysis  shows  that  the  symmetric  Bayes  procedures 
for  the  uncontrolled  case  are  given  by 


cfy(x)  = 1 if  x (R0  = (x:  - < c J J e §) 


(x)  - 1 if  x ^ Rq  and  {x^.  i £ I)  = {x  + j j » • • • ^ ' 


4.  Multinomial  Population 


Let  x = (Xj,...^^)  be  an  observation  from  a multinomial 
distribution  with  parameters  N and  0 = (0j,...,6^): 


x . 

k 0.J  k k 

C]’’"’’Xk’  = N!  n [—  p] , J x.  = N,  J 0.  = 1 

j=l  j-  j=l  J j=l  J 


0 < e <_  1,  1 ^ j <^k. 


In  the  notation  of  Section  1.2  wt  have 


% * <!  = el 


• = £-)>  * {0Q}.  say 


and  for  J c S 


m+A  if  j fc  J 

®T  = (0:  0.  = } 

J ox  if  j $ J 


1-ko)  „ • 1 

— r * 0 1 w < r ■ • 


where  A 
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Hence  Pq(x)  = 


k n (X.,) 

-i-1  J • 


i \ i,N  , » N,.  1-kuK  ?.j  xi 
Pj(x;u>)  = k P0(x)u  (1  + -^j-)  jeJ  j 


It  is  easily  verified  that  the  multinomial  distribution  has  property 
M,  and  hence  the  set  RQ  is  given  by  the  inequalities 


^(Lq-LjJPqW  + 5/  [Lq (J »w) -LjCJ iw)]Pj (x;u»)dG(w)  <0  VJ  € a. 


It  follows  from  (1.4.8)  and  (1.4.9)  that 


Rn  = (x:  max  ( J x.)  < c) 
U " JeS  j€J  3 


and  the  class  of  symmetric  Bayes  procedures  is  given  by  Theorem  1.3.2. 


1.5  Multiple  Slippage  roblem  for  Location  in  the  Presence  of  a 
Sufficient  Statistics  (Controlled  Case) 


Here  we  have  observations  Xj,...,x^,y  from  k+1  independent 
populations  which  are  distributed  as  f (x-0j) , . . . ,f (x-0^) , f(y-0Q) 
respectively,  where  x^,y,0^,0Q  are  all  real.  Assume  that 

(i)  f(x)  is  strictly  positive 
(ii)  f(x-0)  has  monotone  likelihood  ratio  (MLR)  in  x and  0 
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15s 


(iii)  f (x)  is  bounded,  and  f(0)  = max  f(x) 

x0R 

(There  is  no  loss  of  generality  in  the  latter  part  of 

assumption  (iii),  since  if  f(x.)  = max  f(x),  we  can  reparametrize 

x 

the  family  of  densities  by  setting  6|  = 0^  + x^) 
k+1 

(iv)  f(x.-9)  = r(x,y)q(0-0) , xR+1  = y 

A A 

where  9 = 0(x,y)  is  the  maximum  likelihood  estimate  (MLE)  of 

0 under  H_:  0.  = 6.H  i. 

0 i 0 

Loss  function  is  assumed  to  satisfy  (l1),  (2'),  (3')  of  Section  1.2, 
and  also  the  following: 

Lj(e1+c,...,0k+c,  0Q+c)  = LI(01,...,0k,0o).  V c 6 TR  , I e {0}  U fc. 

The  joint  pdf  of  maximal  invariants  - Y,  i = l,...,k  is 

given  by 

k 

p(u;u)  = / n f (u.-w.+s)f (s)ds 
]R  i=l  11 

where  w.  = 0.  - 0.,  i = l,...,k. 
l l 0 

It  follows  from  assumption  (ii)  on  f that  p(u;a))  has  property  M, 
and  therefore  the  symmetric  invariant  Bayes  procedures  are  given  by 
Theorem  1.3.2,  with  defined  as  follows: 

k 

R0  = (u  ]R  : S0 ^0"kj)P0 (“)+£/ [Lq (J, A)-Lj («I, A) ]Pj (u; A)dG(A)  < 0 

V J e S (1.5.1) 

where 

P j (u; A)  = / n f(u  -A*s)  n f (u.+s)f (s)ds 
1R  jO  3 j*J  3 
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f (u. -A+s)  k 

* / n tffu  +'s) — 1 11  f(u.+s)f(s)ds. 

JR  jfcJ  riuj  S}  i=l  1 

A A 

From  assumption  (iii)  and  the  fact  that  0(x+c)  = 0(x)+c 

k 

,Vc  IR  , x € 1R  (see  Lemma  5.2.1  of  Karlin  and  Truax  (I960)), 
we  have 

k+1 

n f(u  +s)  = r(u)q(0(u)+s) 
i=l  1 " 

where  u = (Uj , . . . ,^,11^)  fcIRk+1,  uk+1  = 0 
and 

f (u.-A+s)  q (6  T (u) -A+s) 

n r — i 1 = J - 

-i.  c-  'l  J 


jfeJ  f(uj+s)  q (0j (u)+s) 


where  0.(u)  is  the  MLE  of  0 based  on  the  subset  (u.:  j £ J). 
j - J 

Substituting  these  values  in  expression  (1.5.1)  for  RQ , and 
changing  the  order  of  integration  , we  obtain 


Rq  - (u:  / f (0.(u)+s)r(u)q(0(u)+s)ds  < 0 , \r  J e g) 
1R 


where 


00 

nz)  = /f  C[L0 (J,A)-Lj(J,A)]  31*1*1.  (Lj-Lq)  }dG (A) 


From  Lemma  5.2.2  of  Karlin  and  Truax  (1960)  we  see  tha<-  q(0-0) 
has  MLR,  and  therefore  H'(z)  is  an  increasing  function  (see  Lemma  2, 
p.  74  of  Lehmann  (1959)). 


Now 
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/ ne  ,(u)*s)q(e(u)+s)ds  = / 'f(z)q(z-[01(u)-0(u)])dz. 

IR  R J ' 

Since  q (z- [0j  (u) -6 (u) ])  has  MLR,  and  fCz)  is  increasing,  the  above 

A A 

integral  is  negative  if  0J(u)-e(u)  < c for  all  J e s,  for  some 
c fc  IR  , 

Hence 

A A 

Rq  = (u:  max  [0  (u)-0(u)]  < c} 

JeS  J " 

= {(x,y):  max  [0  (x) -0 (x,y) ] < c}  (I.S.; 

JCS  J ’ 

by  translation  invariance  of  0(u). 

(b)  Uncontrolled  Case 

In  this  case  assumption  (iii)  of  case  (a)  is  to  be  replaced  by 

k 

n f(x.-o)  = r(x)q(0-e). 
i=l  1 

The  loss  function  is  assumed  to  satisfy  (1),  (2),  (3)  of  Section  1.2 
and  also  the  following: 

LI(0I+c,...,0k+c)  = LI(01,...,0k)  V c fe  IR  I c {0}  u S. 

A maximal  invariant  is  U = (Uj  , . . . ,Uk_j)  = (Xj-Xk, . . . ,Xk_rXk) 
which  is  distributed  as 


p(u;w)  = / n f (u.  -ui.+s)f  (s)ds  where 
IR  i=l  11 


“i  = ei‘0k’  i = 

Using  the  method  of  Example  2 (b)  of  Section  1.4,  we  can  show  that 
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f(u ,+z-A)  k 

Pj(u;A)  = J n [f , J — ] ji  f(u  +z)dz,  with  u,  = 0. 

© jfcj  1 uj  i = 1 i k 


q (e  j+z-a) 

= / — ^ 


IR  q(0j+z) 


r (u)q (e+z)dz 


and 


P0(u)  = / r(u)q(6+z)dz. 

© 


Then,  as  in  the  controlled  case,  we  have 


R0  = / 't'(6J)r(u)q(e+s)ds  < OJ  J U} 


IR 


with  fO)  as  defined  in  case  (a) 


= (u:  max  [0T(u)-0(u)]  < c} 
JfcS  J " 


= (x:  max  [0  (x)-e(x)]  < c} 


Jfc  S 


(1.5.3) 


We  now  prove  that  the  class  of  Bayes  rules  of  Theorem  1.3.2,  with 
Rq  given  by  (1.5.2),  is  minimal  essentially  complete  if  the  loss 
function  satisfies  two  additional  assumptions: 


(i)  L:  S * 6 -*•  © is  bounded 


(ii)  y(A)  = LQ(J,A)  - Lj(J,A)  is  monotonical ly  increasing  in  A 

(By  assumption  (1)  of  Section  1.2,  y(A)  is  independent  of  J.) 


-'-e--ma  * • ^ * * • ^et  ^ = ^O’^l  ’ ’ ’ ‘ »^m^-Given  a symmetric  invariant 


m 


rule  (f,  we  can  find  a symmetric  invariant  Bayes  rule  cp(m)  such 


1 


that 
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p(cp,A)  - p(cp(m),A)  ^ 0 V A fc  D(m) 


Proof : In  the  terminology  of  Blackwell  and  Girshick  (1954)  consider 

,(m) 


and 


the  game  (D  m , G,  p).  Since  the  set  of  states  of  nature  D 

D R. 

the  action  space  G are  finite,  the  set  W c 1RK  (p  = (t)+l)  defined  by 

W = (w(a)  = (wQ (a) , . . . >wm(a) ) : w . (a)  = L(a,A.) 

i = 0,1,. . . ,m, a £ G) 

is  closed.  (Here  L(a,A^)  represents  the  loss  incurred  in  taking 
action  a fc  G when  A^  £ is  the  state  of  nature.) 

Moreover,  since  L is  bounded,  the  set  W is  also  bounded. 

It  follows  from  Theorem  2,  Section  2.10  of  Ferguson  (1967)  that 
Bayes  rules  for  the  game  , G,  p)  form  a complete  class,  and 

then  there  exists  a Bayes  rule  such  that 

p(cp,  A)  - p (cp^m^ , A)  > 0 J A t D^. 


Theorem  1.5.1:  Under  assumptions  of  this  section,  the  class  of 

procedures  of  form 


tpo  (?) 
cpj(x) 


1 if  max  (0.-0)  < c 
JfcS  J 

A A 

1 if  max  (0.-0)  > c and 


lXi : 1 * I}  {x[k-t+l]’*  " ,X[k]} 


is  a minimal  essentially  complete  class  of  symmetric  invariant 
procedures . 


' 
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Proof:  We  follow  the  proof  of  Theorem  5.2.2  of  Karlin  and  Truax 

(1960).  Since  y(D)  is  a monotone  function  of  A,  there  exists  a 

countable  dense  set  D = { AQ , A^ , . . . , Am> . . . } with  A^  = 0 such  that 

all  points  of  discontinuity  of  y(A)  are  contained  in  D.  By  Lemma 

1.5.1,  given  a symmetric  invariant  rule  cp  we  can  find  Bayes  rule 

tp^ , which  is  also  symmetric,  such  that  cp^  dominates  <p  at  A^, 

A , . . . , A . That  is 
l m 

p(cp^,A^)  - p(cp,A^)  <_  0 for  i = 0,1, ...,m  (1.5.5) 

where,  for  any  rule  cp* 

p(cp',A)  = L0(0)/cp^(x)p0(x)dx  + Lj  (0)  J cp}  (x)pQ  (x)dx 

1 

= -Y0J%OOp0QOdx  + LJ  c°)  (1.5.6) 

where  yQ  = Lj(0)  - LQ(0)  > 0,  and  for  A > 0 

p(cp)A)  = L0(A)/cp^(x)pQ(x)dx  + l LI(A)/cp|(x)-pI(x;A)dx 

I 

w 

+ LJ(A)/cpj(x)pJ(x;A)dx 

= Lq  (A)/[  1-cpj  (x)  ]pj(x;A)dx  + Lj(A)  •/cpj(x)pJ(x;A)dx 
by  assumption  (3)  of  Section  1.2 

= “Y (A)/cpj(x)Pj (x; A)dx  + LQ (A) . (1.5.7) 


By  symmetry  considerations  the  right  hand  sides  of  (1.5.6)  and  (1.5.7) 
are  independent  of  J. 
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Since  cjr  ■*  is  a symmetric  invariant  Bayes  rule,  there  exists 


a constant  c such  that 
m 


cp^m^  (x)  = 1 if  x t (max  [0  T-0]  < c } = R^ , say 
u - - j^g  J mu 

cp^-’fx)  = 1 if  x i R^m)  and  {xi:  i t II  = {x[k-t+l] ’ ' ' ' ,X[k] } ' 

~ k 

Let  R*  = {max  [ 0 - 0 J < c*}  c 1R  , where  c*  is  a constant  satisfying 

0 jes 


Jcf-0(x)p0(x)dx  = I P0(x)d x 


(1.5.8) 


Define  a symmetric  invariant  Bayes  rule  cp*  by 


<Pq(x)  = 1 if  x R* 


tpj(x)  = 1 if  x i R*  and  {xi:  i fe  I)  = ^x[k.t+1] » • • • >x[k] } ■ 


(1.5.9) 


We  show  that  the  rule  cp*  dominates  cp  for  all  A > 0.  It  suffices  to 


show  that 


p (cp*,A)  <_  p (cp.  A)  . D 


since  any  A1  4 D is  a point  of  continuity  of  y(A)  from  which  it 
follows  that  p(cp*,A')  <_  p(cp.A').  Let  A = Am.  Then  it  follows  from 
(1.5.5),  (1.5.6)  and  (1.5.7)  that 


/<Pq  (x)P0(x)dx  = / pQ(x)dx  >_  / <^(x)p  (x)dx 

R (m) 

R0 


(1.5.10) 


and  /cpjm^  (x)pJ(x;Ai)dx  ^ /cpJ(x)pJ(x;Ai)dx,  i = l,...,m.  (1.5.11) 


It  is  clear  from  (1.5.8)  and  (1.5.10)  that 
d R* 

K0  3 Ko 


and  therefore 


/cpj(x)Pj(x;Ai)dx  > /cP]m)(x)pJ(x;Ai)dx 


i /cpj(x)pJ(x;Ai)dx  ,v  i = 1, . . . ,r 


Since  the  above  holds  for  every  m,  the  result  follows. 

The  proof  of  minimality  of  the  essentially  complete  class  is  the  same 
as  in  Theorem  5.2.2  of  Karlin  and  Truax  (1966). 


Remarks : 


1.  If  the  direction  of  slippage  is  unknown,  we  have  a symmetric 

two-sided  t-slippage  problem.  The  symmetric  invariant  procedures 
for  this  problem  are  of  the  following  form: 


cfQ(x)  = 1 if  x £ Rq  = (x:  max  ( | 6 ^ - 0 1 ) < c) 

9 

cpj  (x)  = 1 if  x t Rq  and  | 0 ^ - 0 | = max|0j-0|. 

JfcS 

2.  It  is  clear  from  the  proof  of  Theorem  1.5.1  that  each  symmetric 
invariant  Bayes  procedure  for  testing  t-slippage  in  the  location 
parameter  case  is  uniformly  most  powerful  in  the  class  of  all 
symmetric  invariant  procedures,  having  a preassigned  error 
associated  with  HQ. 

3.  Similar  results  hold  for  the  uncontrolled  case. 
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1 1 lustrations 


1. 


Normal  populations  with  a common  known  variance. 

2 

x 

• 1 2 

It  is  easy  to  see  that  f(x)  = e , - °°  < x < 00  satisfies 

/2tT 

all  the  assumptions  of  this  section,  with 


e = 


kx+y 

k+1 


kx 


for  the  controlled  case 


for  the  uncontrolled  case 


l 


and 


; _ JfcJ  J _ - 

0j  t ■ xj- 


It  is  easily  seen  that  the  Bayes  procedures  obtained  by  substituting 
the  expressions  for  0 and  0j  in  (1.5.2)  are  same  as  the  ones  given 
in  Example  1 of  Section  1.4. 

2.  Location  parameter  of  the  exponential  density 


Here 


f(x-6)  = 


-(x-0) 

e if  x > 0 


otherwise 


For  the  problem  involving  a control  we  have 
6 = minixj , . . . ,xk,y} 


and 


0.  = min{x.} 

J jfcl  J 

and  therefore  the  set  is  given  by 
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Rn  = {(x,y):  maxlmin  x.  - min(x. , . . . ,x,  ,y) ] < c}.  (1.5.4) 

JfcSjU  J J k 

The  set  R^  for  the  uncontrolled  case  is  obtained  by  replacing 
(x,y)  = (Xj,.  ..,xk,y)  by  x * (Xj,...^.)  in  (1.5.4). 

1.6  Slippage  Problem  for  Scale  Parameter  Distributions 


We  discuss  the  controlled  case  first.  Here  x. , i = l,...,k  are 
independent  observations  from  populations  with  density  functions 

0 fOg— )«  i = l,...,k,  respectively,  and  y is  an  observation  from 
i i 

• • lx 

control  population  which  has  density  — f( — );  x > 0,  0.  > 0, 

60  90  1 

1 = 0,1,  — ,k. 

As  in  location  case,  we  assume  that 

(i)  f(x)  is  strictly  positive  for  all  x > 0. 

(ii)  f(^)  has  MLR  in  x and  0 

(iii)  MLF.  of  0Q  exists  under  HQ:  =...=  ©k  = 0Q>  and  is  sufficient 

for  0. 

Loss  functions  are  assumed  to  satisfy  (1'),  (2'),  (3')  of  Section  1.2 
and  the  following: 

LI(c01,..,cek,c6o)  = LI(01,...,0k,0o)  V c > 0,  IfcfOlUS. 

Analysis  similar  to  that  for  the  location  parameter  t-slippage 
problem  shows  that  every  symmetric  invariant  Bayes  rule  has  the  form 


, K+i  ejO»y) 

<p0(x1,...,xk,y)  = 1 if  (x,y)  U = ((x,y)  t:  IR  ; maxtr^-1 ] < c) 

J6S  % 

(x,y) 


L 


(1.6.1) 
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cpjOt.y)  = 1 if  x t RQ  and  (x.:  i 6 1}  = {x[k.t+l] ’ • • • »x[k] } 1 
An  analogous  result  holds  for  the  uncontrolled  case. 

Remark : 

The  class  of  symmetric  invariant  Bayes  rules  given  by  (1.6.1) 
is  minimal  essentially  complete. 

1.7  Applications 

Selection  of  t-best  populations  under  slippage  configuration. 
Let  be  an  observation  from  population  with  pdf  ffx.e^) 
(i  = l,...,k),  6^  t © c 1R  ; we  assume  that  f(x,9)  has  MLR  in  x and 
0.  It  is  also  assumed  that  the  parameters  0^  (i  = l,...,k)  are  in 
the  following  configuration: 


0 - 0jjj  0[k_t+2]  < 0[k-t+l]  = ,=  9[k]  = 9+A’  9 A > 0 

(1.7.1) 

where  0ril  <...<  0r,  n denote  ordered  0.,  and  t(l  < t < k)  is  a known 
[1]  - - [k]  i'  - 

integer.  The  problem  is  to  identify  the  subset  of  populations  which 


are  associated  with  (0^  t+1]  ’ ‘ ‘ ‘ ’ 9 [k]  ^ ' 


The 


action  space  G consists  of  ( ) elements: 


G = { a . : J £ S ) 


where  S is  as  defined  in  Section  1.2.  Assume  that  the  loss  function 
satisfies 


(i)  l (0f)  = Lj(e).  v its,  » esk 


I] 
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(ii)  Lj (0)  < Lj (0)  if  0 

(iii)  Lj(0)  = LjC0)  if  0i 

and  8 . 
J 

Then  we  have  the  following 


= 0+A.  K j t J,  I € S (I  i J) 
= 6 for  at  least  one  i fc  I 
= 0 for  at  least  one  j J. 
theorem  : 


Theorem  1.7.1:  If  the  above  assumptions  are  satisfied,  the  symmetric 

Bayes  rules  for  the  problem  of  selecting  the  t best  populations,  when 
the  parameters  are  in  the  slippage  configuration  given  by  (1.7.4), 
is  to  select  the  set  associated  with  (x^  t+1]  ’ ‘ ‘ ’ ,X[k]  ^ ’ 

2.  Selection  of  the  normal  populations  associated  with  largest 
probability  P(a  < x £ b) . 


Here  we  have  k normal  populations  N(p^,l)  i = l,...,k,  and  the 
problem  is  to  select  the  subset  of  populations  which  are  associated 
with  the  largest  t values  of  the  coverage  of  the  set  A = (a,b]  c 1R  , 
- »<^a<b£°°,  where  t (1  <_  t < k)  is  a known  integer  and  the 
coverage  of  a set  A for  a population  with  distribution  function 
F(x,u)  is  defined  as  [c.f. , Guttman  (1961)] 


I 


i 


Y(A,p)  = / dF(x,p) . 

A 

The  problem  of  selecting  a subset  containing  the  population  associated 
with  the  largest  coverage  for  A = (-°°,a]  has  been  considered  by 
Guttman  (1961).  Guttman  has  investigated  Gupta  type  procedures  for 
normal  and  exponential  populations.  Guttman  and  Tiao  (1964)  have 
considered  the  case  A = (-a,b]  and  have  derived  Bayes  rules  for  two- 
parameter  normal  and  exponential  populations  for  the  problem  of 


i . _____ 


, , rii  mm I'mirfii 


selecting  the  best;  the  a priori  distributions  for  the  two  parameters 
are  taken  to  be  independent  and  locally  uniform.  We  consider  the 
problem  of  selecting  the  t best  populations. 
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We  have 

b 

YCp)  = 4* ( (a,b]  ,p)  = / d4> (x-p)  = * (b-p)-<l>(a-p) 

a 

where  $ denotes  the  distribution  function  of  a standard  normal  random 
variable.  It  is  easily  seen  that 


(i) 

p) 

- r(2£- 

p) 

i.e.,  V 

is 

symmetric 

about  pQ  = 

(ii) 

Up)  - 

0 as  |p|  ■*  00 

(iii) 

f > 0 

dp  - 

if 

^ i ^0 

1 0 

if 

v t p0‘ 

Hence  Y(p)  is  monotonical ly  increasing  in  (-“MJq)  and  monotonical ly 
decreasing  in  (pg,“>).  It  follows  from  (i)  - (iii)  that 

lbi-u0|  < |p2-p0|  * nuj)  > nu2).  d.7.2) 

Let  0^  = It  follows  from  (1.7.2)  that  the  problem  of 

selecting  the  populations  associated  with  t largest  Y(p^)  is  equiva- 
lent to  the  problem  of  selecting  the  t populations  which  correspond 
to  ^ [ i j ’ * * * ’ ^ [t]  ^ * 

Let  Y.  = |x.-p„|;  it  is  easily  shown  that  if  X.  is  normal  with  mean 

l l 0 ' l 

p^  and  variance  1,  the  density  of  is  given  by 
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= -Ke^y)  + ♦(ei-y),  ei  > o,  y > o 

e‘x2/2 

where  <#>  (x)  . 

STH 

We  now  show  that  the  function  f(y,0)  = <f>(6+y)  + <K0-y)»  y > 0,  6 > 0 
has  MLR  in  y and  0. 

We  have 


— = (0-y)<|>(0-y)-(0+y)cp(0+y) 

and 

2 

lof^r  = cp(0+y)  (e+y)2  + cpC0-y)  C0-y) 2 
+ cp(0 -y ) - cp(e+y) 

>_  0 since  cp(0-y)  >_  cp(0+y),  V 0 > 0,  y > 0. 

It  follows  from  Ex.  6 on  p.  Ill  of  Lehmann  (1959)  that  f(y,0) 
has  MLR  in  y and  0. 

Then  assuming  that  the  loss  function  L (0)  satisfies  the  conditions 

J " 

given  in  Ex.  1 discussed  above,  it  can  be  easily  seen  that  a symmetric 
Bayes  rule  for  the  problem  is  to  select  the  subset  of  populations 
associated  with  {y^ j > • • • ’^[t]  ^ * 

3.  Selection  in  terms  of  entropy  for  binomial  distributions 


Let  , i = l,...,k  be  k independent  binomial  random  variables 
with  distributions  given  by 


P(X=x,p.)  = (")pX(l-p.) 


n-x 


x = 0,1 n,  l — 1 , . . . ,k . 


Let  0.  = H(p.)  = -p.  log  p.-(l-p.)  log  ( l-p±) . The  function 
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H(pt)  is  the  entropy  function  associated  with  the  binomial  population 
[c.f. , Gupta  and  Huang  (1976)]. 

Let  6 r . i <...<  0,.  1 1 <...<  0ri1  denote  the  ordered  0..  The 

[1]  - - [k-t+1]  - - [k]  l 

populations  associated  with  0^  » • • • »e are  interest.  The 

problem  of  selecting  a subset  containing  the  population  corresponding 

to  0.  , has  been  considered  by  Gupta  and  Huang  (1976).  Here  we  find 
J 

a Bayes  rule  for  selecting  the  t populations  associated  with 
0 [k-t+1 ] ' ’ “ ,6[k] ' 

Let  Uj  = |pi  - j|,  i = l,...,k.  It  is  shown  in  Gupta  and  Huang 
(1976)  that  the  function  H(p)  strictly  decreases  with  p = |p  - j|, 
and  hence  the  problem  of  selecting  (0^  t+1] ’ ’ " ' ,6[k] ^ e9u^va^ent 
to  the  selection  of  (iij  j j » . . . j ). 

Assume  that  p = (p.,...,p.)  fc  U ft,  = ft  where 

< Jts  i 

♦ at  if  j U,  id  f:  IR  , A / 0 / 


where 


e-  pj= 


'i  ♦ w+A  if  j £ J |(u|<|u)+A| 


£ 

The  action  space  G = (a  : J 8)  consists  of  ( ) elements. 

J X 

The  loss  function  Lj(p)  is  assumed  to  satisfy  (i)and  (ii)  of  Ex.  1 
of  this  section  and  also  the  following: 


(iii)  Lj(p)  < Lj  (p)f  ip  fcflj,  I,J  tS,  I / J 

(iv)  L j (p)  = Lj  (1-p)  ,ip  J U 

where  1 = (1,..,1)  is  a row  vector  in  IR . 

Since  the  problem  is  invariant  with  respect  to  the  group 
G - (e,g),  where  e is  the  identity  element  and  gx^  = n-x^,  we  can 
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restrict  attention  to  procedures  based  on  maximal  invariants 
Y.  = |x.  - §|,  i = l,...,k. 

Let  Q(y,0^)  denote  the  probability  distribution  of  the 
random  variable  Y.  = |xi  - , given  6i  = |pi  - . Then  Q(y,6i) 

has  MLR  in  y and  6^  [c.f.,  Sobel  and  Starr  (1975)]  and  hence  a 
Bayes  rule  against  an  exchangeable  prior  is  to  select  the  populations 
associated  with  Y[i] » • • • * t*ie  * smallest  values  of  the  observed 
values  of  Y. . 

l 

4.  Selecting  the  biased  coin 


We  are  given  k 2 coins,  one  of  which  may  be  biased.  Our  goal 
is  to  decide,  on  the  basis  of  n tosses  of  each  coin,  whether  all  the 
coins  are  fair,  and,  if  not,  which  one  is  biased. 

Let  p^  = probability  of  getting  Head  for  i-th  coin  (i  = l,...,k) 
Then  0^  = |p^  - y|  is  called  the  bias  of  the  i-th  coin  [c.f.,  Sobel 
and  Starr  (1975)].  The  problem  is  to  test 


V e,  -•••  \ ■ 0 


against  k alternatives 


H.:  0. 
J i 


i = J 


0 if  i t j 


. ,k,  0 < A < y. 


Let  x^  be  the  number  of  heads  in  n tosses  of  coin  i , Invariance 
reduces  the  problem  to  consideration  of  tests  based  on  maximal 


The  random  variables  Y^  i = are  independent,  and  Yi  is 

distributed  as 


Q(y.ei) 


n 2+y  2 -y  i -y  1 +y 

(n  )[0  i d-ep  *0.  d-o.)2  ] 

2 +y 


Let  Q0(y>9)  and  (y,0)  denote  the  distribution  of  Y = (Yj,...,Y.) 
under  HQ  and  respectively.  Then  simple  calculation  yields 


(n  " 3 

k I +yi 

and  (y;A)  = Qq Cy)h(y^ ;A) 
where 


h(y;A)  = 2n_  1 [A  (1-A)  ] 2 [(j^  ♦ (±^)yJ. 


(1.7.3) 


For  each  fixed  A,  we  have 


fr=  2n-1tA(l-A)]n-1[{(^)y  - (y^Hog 


> 0 J y since  0 < A < j, 


Hence  the  function  h(y;A)  increases  with  y.  Assume  that  the  loss 
function  satisfies  assumptions  (!'),  (2'),  (3')  of  Section  1.2, 
with  0q  = 0.  and  t = 1.  Then  all  the  assumptions  of  Theorem  1.3.2 
are  met,  and  a Bayes  rule  against  an  exchangeable  a priori  distribu- 
tion has  the  form 
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<fy(y)  = 1 if  y Rq,  a symmetric  set 

cp.(y)  = 1 if  jr  R and  y = max  y.. 

3 i<i<k  1 

Here  the  set  Rq  is  given  by  the  following  inequalities: 

* c/  [L^Cj  , A) -Lj  (j  ,4)  ]Qj  (y;  A)d&A  <0  V j = l,...,k. 
Substituting  for  Q^(y;A)  we  get 

U[L0(j,A)-Lj(j,A)Jh(yj;A)-C0(Lj-L0)}Qo(y).dG(A)  J j = l,...,k 
where  h(y;A)  is  given  by  (1.7.3). 

Since  the  integrand  is  an  increasing  function  of  y.,  we  have 

R0  = (y:  max  y.  < c} 

I<i<k  1 

1.8  Nonparametric  Slippage  Tests 

l*et  (j  = l*-..»n)  be  n independent  observations  from 
population  ir.  with  continuous  cdf  F (i  * l,...,k).  Let  g:  [0 , l]-»-[0 , 1 ] 
be  a continuous  distribution  function,  g(x)  t x.  The  subset 
{ n j : j t J),  where  J fe  S,  is  said  to  have  slipped  if 


FJ  = 


g(F)  if  j U 

F if  j i J 


The  problem  is  to  test 


V Fi  a F*  * 3 k 


\ 


against  ( ) alternatives 


Hj:  { iTj : j fcJ)  has  slipped. 

Karlin  and  Truax  (1960)  have  considered  the  case  t = 1 for  the 
following  two  types  of  slippage: 


(i)  g(F)  = ( 1 — A) F + AF  , 0 < A < 1 


(ii)  g (F)  = F1  A,  A > 0. 

We  will  investigate  the  general  case  (1  < t < k) 

(i)  g(F)  = (l-A)F  + AF2. 

Let  r. . be  the  rank  of  X. . in  the  combined  sample,  and  R = (r. .), 
lj  ij  r ij 

i = l,...,k,  j = l,...,n.  Since  the  problem  is  invariant  under 
monotone  transformations,  we  can  restrict  attention  to  procedures 
based  on  R. 

Let  P^(R)  denote  the  distribution  of  R = (r..)  under  H,. 

J ij  J 

It  follows  from  a result  of  Lehmann  (1953)  that 


pjX)(R) 


Lnk-ntJ 


E [ n { (1-A)+2A  U ^ 

l<H<n 

j€J 


(1.8.1) 


where  Uv  , 1 < m < nk  is  the  m-th  order  statistic  from  a sample 
of  size  nk  from  a uniform  distribution  on  [0,1].  Differentiating 
(1.8.1)  with  respect  to  A we  get 
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p(X) 

dA  FJ 


1 (riP 

E [ n { (1- A)+2A  U 3 } 


, nk  . , „ 

( . .)  l<£<n 

nk'nt  j€j 


n 

•a  i 

jfcJ  £=1 


(rU) 

2 U J -1 

Cri  £) 

(1-A)+2AU  3 


}] 


nt 


where  r = £ r.f. 

J-  £=1  3 


Following  Karlin  and  Truax  (1960),  we  find  regions  C_,  {CT:  J fc  S) 

U J 

in  the  set  of  all  possible  ranks  such  that 


P0(CQ)  = 1-a,  0 < a < 1 

pj^(Cj)  is  maximum  overS  for  small  A 
and  the  procedure  is  symmetric.  It  follows  that  Cj  has  the  form 

max  { l T.  ) = l r > y (1.8.2) 

JfcS  jfeJ  3 ’ jtl  1 

where  Yj  is  given  by 

P (max  l r.  < y,)  = 1-a  (1.8.3) 

u jfes  jy  1 

(ii)  g(F)  = F1+\  A > 0. 


As  before,  we  will  consider  procedures  based  on  ranks  only. 
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Let  Wj  <...<  w^  be  the  sequence  obtained  by  ordering  the  combined 
sample  (x.^:  i = j = l,..,n).  For  each  J ^ g,  we  define 

a new  sequence  , . . . ,Z^  as  follows: 


.(J)  _ 


1 if  w.  comes  from  { tt . : j t 
1 


0 otherwise 


Let  P (z)  denote  the  probability  of  a rank  order  z under  H, 
J J 

Then  from  a result  of  Savage  (1956)  we  have 


(nk-nt) 1 (nt) 1 (l*A)nt 

" [ l {zJJ)(l*X)+l-2JJ)}] 

i=l  4=1  1 


(1.8.4) 


Differentiating  (1.8.4)  at  A = 0 we  get 


d p(x^f-'|l  - (nk-nt) ! (nt)  1 r "k  v ,, 

dlPj  (!,L  cHvtF  [nt ' J,  J1(-r» 


Then,  as  in  case  (i),  the  symmetric  rule  tp  which  maximizes  P^(C.) 

J 

for  A in  the  neighborhood  of  zero  among  the  class  of  rules  satisfying 


W - 


is  given  by 


nk  i z 


c^U)  = 1 if  min  l l (^ — ) > y 
Jes  i=l  4=1  1 1 


(1.8.5) 


nk  i z(J)  nk  i z[!) 

V V , 4 . r r , 4 


cpj(z)  = 1 if  min  l l {-r— ) = l l (-4—) 

1 Ts-#*  1 1 n _ 1 A •_■»«!  A 


JfcS  i=l  4=1 


i=l  4=1 


(1.8.6) 
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1.9  Computation  of  the  Constant  c 

The  Bayes  rule  derived  in  this  chapter  are  of  the  form 
<pn(x)  = 1 if  x t {x:  max  U (x)  < c}  = R_,  say, 

■ jes  J " 0 

^(x)  = 1 if  x t R0,  Uj  (x)  > UjCx),  ¥ J t I 

where  Uj(x)  is  a symmetric  function  of  x;  the  constant  c depends 

on  the  a priori  distribution  G.  Following  Karlin  and  Truax  (1960),  c 

may  be  determined  so  that  E(cpg|Hg)  = 1-a;  this  distribution  of 

max  U.(x)  is  needed  for  this  purpose.  For  normal  and  gamma  populations, 
J * 

the  distribution  of  Uj  are  given  in  Doornbos  (1966).  For  nonparametric 
slippage  tests,  the  constant  c can  be  computed  from  direct  calculations. 
In  general  case  the  distribution  has  not  been  worked  out  and  approxima- 
tions are  needed. 
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CHAPTER  II 

A BAYES  APPROACH  TO  SELECTION  OF  t BEST  POPULATIONS 


2.1  Introduction 


Let  be  k independent  populations,  which  have  pro- 

bability densities  f (x,0^) , . . . ,f (x.B^) > respectively,  with  respect 


to  a o-finite  measure  p.  Let  0r,.  < ...  < 0ril  be  the  ordered  0.. 

[1]  - - [k]  i 


For  a given  integer  t,  1 <_  t < k,  the  subset  of  populations  associated 


with  0 ^ t+1] ’ ‘ ' ’6  [k]  be  referred  to  as  the  set  of  t-best 


populations.  In  many  practical  situations  the  set  of  t-best 
populations  is  of  interest.  Multiple  decision  (selection  and  ranking) 
rules  for  the  problem  have  been  considered  in  the  literature  by 
Bechhofer  (1954),  Carroll,  Gupta  and  Huang  (1974),  and  others.  Gupta's 
approach,  which  is  commonly  known  as  the  subset  selection  formulation, 
is  to  select  a subset  S of  random  size  |s|  (t  _<  |s|  £ k)  such  that 
S contains  the  set  of  t-best  populations  with  probability  at  least 


P*,  where  P*  is  a preassigned  constant.  In  Bechhofer 's 

indifference  zone  formulation,  a subset  of  size  t is  chosen  so  that 
the  probability  of  selecting  the  t-best  populations  is  at  least  P* 
whenever  * 6^  tj  ^ <5*,  where  6*  > 0 is  a constant  specified 

in  advance. 


In  this  chapter,  the  problem  of  selecting  a subset  containing 
the  t-best  populations  has  been  considered  from  Bayes  approach.  Deely 
and  Gupta  (1968)  have  considered  the  problem  of  selecting  a subset 
containing  the  best,  which  corresponds  to  t = 1 . It  is  proved  there 
that  under  certain  assumptions  on  f(x,0)  the  Bayes  rule  selects  exactly 
one  population  associated  with  the  largest  observation,  provided  the 
loss  is  'linear  in  0^,  and  the  a priori  distributions  of  0.  are  inde- 
pendent. The  problem  for  general  t is  formulated  in  2.2  and  it  is 
shown  in  Section  2.3  that  if  the  joint  distribution  has  property  M of 

k 

Section  1.2, and  the  a priori  distribution  g(0)  on  on  © is  exchangeable, 
then  for  a loss  function  which  is  linear  in  0^,  the  Bayes  rule  will 
select  exactly  t populations  corresponding  to  the  t largest  observations 
For  t = 1 Bayes  rules  for  non-linear  loss  functions  have  recently  been 
considered  by  Goel  and  Rubin  (1975),  Chernoff  and  Yahav  (1977),  and 
Bickel  and  Yahav  (1977).  In  Section  2.4  we  derive  some  Bayes  rules  for 
selection  of  a subset  containing  the  t best  populations,  when  the 
loss  functions  are  non-linear  in  0^;  normal,  exponential,  Poisson 
and  binomial  populations  have  been  discussed. 

2.2  Formulation  of  the  Problem 


Let  n ,...,1!^  be  k independent  populations  with  densities 
f(x,0j) , . . . ,f(x,0j^) , respectively,  with  respect  to  a o-finite  measure 
p,  0^  fc  © c N (i  = l,...,k).  The  goal  is  to  select  a subset  containing 
the  t best  populations,  i.e.,  the  populations  associated  with  0^  t + 

0-kj , where  0^j  <_  ...  _<  0^  are  ordered  0^. 


1 


The  action  space  U consists  of  J (k)  elements- 

m 

m=t 

u = (S  c {1, . . . ,k}:  | S | > t} 

where  |s|  denotes  the  size  of  the  selected  subset  S. 

The  loss  function  L(0,S):  ©k  x U -*■  IR  gives  the  amount  of  loss 

incurred  in  selecting  a subset  S U,  when  0 € ©k  is  the  true  value 
of  the  parameter. 

The  following  loss  functions  will  be  considered: 


k 

[j.  9 r • -i  y 0 . 

i=k-t+l  ^ jes  3 

t fsr 


4(6, S)  = c | S | 


(2.2.1) 


where  c > 0 is  a known  constant. 


k 

' c)  - -I  s [ I J i = + J ^ fL  •’1  r 
■ ’ --S-  J * V*  s. 


(2.2.2) 


where  0 and  c are  given  non-negative  constants,  I represents  the 

A 

indicator  function  of  a set  A,  and 


S'  = (0. : it  s}' 


(iii)  L3(.0,S) 


= c S ♦ 


Jj‘Vi-1]  * jIb'V] 


(2.2.3) 


where  0jg^  1 ...  _<0g  are  the  t largest  ©i  in  the  subset  S,  c 
is  a known  nonnegative  constant  and 
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' 

We  assume  that 

(i)  f(x,0)  has  monotone  likelihood  ration  in  x and  0 

k 

(ii)  g(0),  the  a priori  distribution  on  6 , is  exchangeable  or 
permutation  symmetric.  That  is, 

g(9i V = g(V-*-’V 

for  all  permutations  i^:  {1,  — ,k}  {l,...,k}. 

(iii)  the  measure  p is  also  symmetric,  i.e., 

dp (x)  = dp(x^). 


j 

j 

1 


Since  only  Bayes  rules  will  be  considered  in  this  chapter, 
attention  can  be  restricted  to  the  class  of  non-randomized  rules 
(see  Ferguson  (1967),  Section  1.8). 


It  is  easily  verified  that  the  loss  functions  L^,  and 
satisfy  monotonicity  and  invariance  properties  of  Eaton  (1967),  and 
hence,  in  each  case,  the  Bayes  rule  d*  for  selecting  a subset 


2.3  Bayes  Rule  for  the  loss  Function  Lj . 

Assume  that  the  loss  in  selecting  a subset  S when  0 is  the 
parameter  is  LjfO.S),  where  is  given  by  (2.2.1). 

Deely  and  Gupta  (1968)  have  considered  the  case  t = 1 with 
loss  function  given  by 


«*•!>  ■ Is  Vetk]  - V- 


It  is  shown  that  under  certain  mild  conditions  on  the  density  f arid 
the  constants  a^,  the  Bayes  rule  selects  only  one  population.  In  this 
section  we  prove  a similar  result  for  the  general  case. 

Let  x.  t 1 be  an  observation  from  it.,  i = l,...,k.  The  posterior 

k 

risk  of  a decision  6:  'X  ■».  L is  given  by 

k 

r(6,x)  = c | S | + F 


r 1 ri 

i=k-t+l  1 
t 


I 6, 

jes 


j 

- I X 


where  rule  <5  selects  the  subset  S with  probability  one. 

It  follows  from  (2.2.4)  that  the  search  for  a Bayes  rule  for 
the  problem  can  be  restricted  to  the  rules  dj (j  = t,...,k). 

Let  0^  denote  the  parameter  values  associated  with  x^.j, 
i = 1 , . . . , k . Then 


r(d  ,x)  - r(d  ,x)  = c + E 
mf  1 - m - 


[ 


IN 

y (e,  . - 0,.  .) 

a=k-m+l  (k-m) 


m(m  + 1) 


b 


] 


We  need  the  following  Lemma: 


Lemma  2.3.1:  If  a density  h(x;e)  has  property  M,  as  defined  in  Section 

1.2,  and  g(0)  is  exchangeable,  then 


E(0(i)  I?)  1 E{-0(j)  I?)  V i > j. 

Proof:  Let  B = {0  £©k;  0^  > 0 }.  Then 

/6k[eii)  ' e(j)]h(^1g(0)d!  = / [e(i)  - e(j)]h(x;e)g(0)de 

+^Bcf0(i)  " e(j-)]h^50)g(01d0 
= / [e(i)  - 0(j)]h(x;0)g(01d0 


Hence 


' 
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Theorem  2.3.1:  If  f(x,0)  has  MLR  in  x and  0,  and  the  a priori  dis- 

k 

tribution  on  e is  exchangeable,  the  Bayes  rule  selects  exactly  t 
populations  associated  with  the  largest  t observations. 

k 

Proof:  Let  h(x;0)  = £ f(x.,6.). 

i=l  11 

It  is  clear  from  definition  that  h(x;0)  has  property  M,  and 
the  result  follows  from  Lemma  2.3.1. 


2.4  Bayes  Rules  for  the  Loss  Functions  L2  and  L^. 


The  Bayes  posteriori  risk  for  rule  dj , when  the  loss  function 


is  L2(0,S)  given  by  (2.2.2),  is 


r(d  ,x)  • H(j)  + f l E(0  r • 1 | x)  t < j < k (2.4.1) 
J " z i=k-t+l  UJ  " 


where 


x) 


k k k — j 

H(j)  - cj  - f-  l E(0  .Jx)  + l l PCe.  = Qf  , |x 
J i=k-j+l  p=k-t+l  q=l  iq’  IPJ 

(2.4.2) 


(2) 

It  is  clear  that  the  Bayes  rule  d in  this  case  is  obtained  by 
minimizing  H(j),  t _<  j <_  k.  Hence 


d(2^  = d if  H(«.)  = min  H(j)  (2.4.3) 

1 t<j<k 

When  the  loss  function  is  Lj(0,S)  given  by  (2.2.3),  the  Bayes 
posterior  risk  in  using  d.  is 


It  follows  from  Lemma  2.3.1  that  A , > A , t < m < k,  and 

m+ 1 — m — — 

f3) 

hence  the  Bayes  rule  d J is  given  by 


if  A = (j:  Aj  _>  0}  is  null  set 

(2.4.5) 

where  SL  = min{i:i  fcA  } if 


A is  non-null 


2.5  Specific  Examples. 

In  this  section,  we  compute  Bayes  procedures  for  the  loss  functions 
L2  and  Lj  in  several  specific  cases. 

1.  Selection  of  t best  normal  means. 

Let  population  ik  be  normal  with  mean  6^  £ (-00,00)  and  a known 
2 

variance  a (i  = l,...,k),  and  let  x^  be  the  mean  of  an 

independent  sample  ,...,x^  of  size  n from  ik;  tk  and 

(i,  j = l,...,k,  i j*  j)  are  assumed  to  be  independent.  We  compute 

the  Bayes  rule  when  the  loss  function  is  L^Ce.S)  given  by  (2.2.2)  and 

v 

the  a priori  distribution  on  ©'  is  k-variate  normal  with  mean 

2 

vector  p = (u,...,u)'  and  covariance  matrix  Z = t 1^  where  1^  is 
the  identity  matrix  of  order  k,  i.e., 


1 
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1c  0 _ 

g(0lt...,9k)  = i r^-  ♦ c— — — )] 

i=l  t 


where  $(x)  is  the  density  of  a standard  normal  random  variable. 

The  posterior  distribution  of  6 = (0^...^)  given  the  ob- 
servation vector  x = (Xj,...^)  is 


k 1 9-  - m. 

g*C0|x)  = n [I  H-v-  *)] 


(2.5.1) 


where 


jo  - 2 

IT  + V 


a 2 
^ + T 


2 a 

T + — 


» i ® 1 , . . . , k 


(2.5.2) 


Let  x jjj  » • • • »x^j  be  ordered  sample  means,  and  let  0^ 
be  the  (unknown)  population  mean  associated  with  x^j,  i = l,...,k. 


E(0(i) I x! * ' ' * ,xk)  = m[i] 


(2.5.3) 


where  m r, , < ...  < m 


[1J  — •**  — are  t^e  ordered  posterior  means. 


We  also  have 


P(0(q)  = 0[p],xr-"’V 


= / P[ 


xl,  max  eCi  ) < y*  min  em>  y v(i  . }c.  K (q) ] 

6 - l<afp-l  1 aJ  j£R(.  1 p-1  1 

3 dGQ  (y) 


(q) 


where 


K(q)  = U, . . . ,q-l,q+l, . . . ,k} 


R(ia)  = RCij.---.ip_j)  = KfqlnUj....,!  }c 


(2.5.4) 


Rx  j°int  conditional  distribution  of  0 given  x,  and  G (:j  is 

the  posterior  cumulative  distribution  function  (cdf)  of  e, 

From  (2.5.1)  we  have 


m 


y " mm 

Gq  (y)=«(  —ill 


U) 


) i 1 = 1 1 • • • 9 k 


(2.5.5) 


where  $(•)  is  the  cdf  of  a standard  normal  random  variable. 
From  (2.5.4)  and  (2.5.5)  we  obtain 


p(ew  " e[p]'xi V 


-/  , I 

(ij» • . . »ip_j}  c K(q)  a=l 


P-1  X[ql  ' X[ia]  2 

I *0*  + ~ ^ o . T2) 


2 2 2 . , 
I-2-(2-  + T2}i 

n n ’ 


l [i  . Hu  . fol  '/HI 


jtRd,) 


2 2 2 3 
T o .0  2.1 

( + T ) * 

n n ' 


t2)] 


. 4>(u)du 


(2.5.6) 
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Using  (2.5.3)  and  (2.5.6),  the  function  H(j)  given  by  (2.4.2) 
can  be  computed  for  each  j,  t <_  j <_  k and  then  the  Bayes  rule  d^ 
can  be  obtained  from  (2.4.3). 

2.  Selection  of  normal  populations  associated  with  t smallest 
variances 


Here  are  independent  normal  populations  each  with 

mean  zero  and  variances  , . . . , respectively.  We  wish  to  select 
a subset  containing  0 [k-t+1] » ■ • • >e  [k]  when  the  loss  function  is 
given  by  (2.2.2)  or  (2.2.3). 

Let  (i  = l,...,k;  j = l,...,n)  be  an  independent  sample 

from  it.  . Then 
i 


2 1 $ 
■ - A 


is  sufficient  for  9. . 

l 

Assume  that  0j » • • • » en  are  independent,  and  6^  has  natural 
conjugate  gamma-2  priori  density[see  p.  54,  Raiffa  and  Schlaifer  (1961)]. 

Then  the  joint  a priori  distribution  of  9 = (<^,...,8^)  is 
k -sfi.a'b'  ia'-l  k 

g(0)  = n (A  e 1 0.  } = nf  (0  j | b ' , a ' ) (2.5.7) 

. . i i = i y *■ 

i=l  11  2 

where  A is  the  normalizing  constant  for  fy  (•  lb',a'),  the  pdf  of  a 

2 

gamma-2  distribution. 

From  Raiffa  and  Schlaifer  (1961)  we  have  the  posterior  dis- 


t 


tribution  of  0 as 


i 2 o K 

g*C01»...,ek|s  ,...,s  ) = n f c®. |b"  a")  (2.5.8) 

i=l  y2  1 1 


where 


a"  = a'+n 

a'b'  + ns? 
b’.'  = - 1 


(2.5.9) 


a' 


LCt  6(k-j+l)  be  the  Parameter  associated  with  s^.,  where 
2 2 J 
S [1  ] - * * * --  s[k]  * Then 


E(0(k-j+l)lsl’'--*sk) 


a'+n 


a'b'  + ns,. . 

[i] 


(2.5.10) 


We  now  have 


H(j)  = cj  - f £ 


a'+n 


k k-j 
+ Z Z 


•j  <->  2 'Aw  4* 

i=l  a'b'  + ns^j  p=k-t+l  q=l 


P(0(q)  = e[i>]lsi»---»sk) 

We  can  compute  P(6fq)  = 6[p] Jsj, . . . ,s£)  by  the  method  indicated 
in  example  1. 

The  Bayes  rule  d^  for  the  loss  function  L2(6,S)  is  given  by 
(2.4.3). 

When  the  loss  function  is  Lj(0,S),  we  need  to  compute 
2 2 

lsi»*  * • »sk^  * denote  the  cdf  of  e.,.,  given 

2 2 _ J 
Sj » • • • j • Then 
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K ■ 
K. 

if ' 


: 


Grkl C0)  = n F (e|b"  a") 
1 J i=l  '2  1 


where  F stands  for  the  cdf  of  a gamma-2  population.  The  function 
'2 

G[^j(9)  can  be  evaluated  by  using  either  of  the  following  relations 
given  in  Raiffa  and  Schlafer  (1961) : 


F (z|b,a)  = Kbz/TT  , -|a  - 1)  = G (-|a|'-^abz) 


where  I(‘,-)  is  the  gamma  function  tabulated  by  Pearson  (1934)  and 
Gp  is  the  cumulative  Poisson  function. 

Now,  since  9^  is  a positive  random  variable,  we  have 


»j,...,s^)  = [1  - Gj-j^-j  (0)]d 


B(e[k)|sf  -2' 

Substituting  the  values  of  E(0 ^ | , . . . ,s£)  and  E(0 | s^, . . . ,s^) 
in  the  expression  for  Am,  we  can  compute  the  Bayes  rule  d^  given 
by  (2.4.5). 


3.  Binomial  populations 

Let  be  k independent  binomial  populations  with 

probabilities  of  success  0.,...,9  , respectively,  0 < 9.  <1. 

1 K X 

Assume  that  0^  are  independent,  and  have  a priori  distribution 

f (*|a,b)  where  f (-|a,b)  is  the  beta  pdf  given  by 
9 p 


fg(0  la»b)  - bCT) 


0a-1(l-9)b'1,  a,  b > 0 


(2.5.11) 


Let  x^  be  the  number  of  successes  in  independent  trials  from 
(i  = l,...,k).  Then  the  posterior  distribution  of  9 = (0 ^ , . . . ,0^) 


given  x = (Xj , . . . .x^)  is 
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g*(e|:0  = n f (e.|x  + a,  n + b - x.) 

i=l  6 1 1 1 


xri  + a 

E(0,.,|x)  = -lil T- 

(i)  1 - n + a + b 

The  Bayes  rule  d^(d^)  for  the  loss  L2(0,S)  (L3(0,S))  can 
be  obtained  as  in  Example  2 discussed  above. 

4.  Poisson  populations 

Here  are  independent  Poisson  distributions  with 

parameters  0j,...,0^,  respectively,  0^  > 0,  i = l,...,k.  Assume 
all  0^  are  independent,  and  0^  has  the  natural  conjugate  a priori 
distribution  f (-ja’.b'),  where 


f (0 |a' ,b')  = Be'eb’0a''1,  0 > 0,  a',b'  > 0 
Y1 


(2.5.12) 


Let  x^  be  an  observation  from  ir^.  Then  the  posterior  distribution 
of  0 = (0j,...,0^)  is  given  by 


g*(9|x)  = n (f  (0  |x.  + a',  1 + b') 

" Y t 1 1 


i=l  ’1 


xr. , + a' 

E(0fi)|x)  = -til 

1 j 1 + b' 


The  Bayes  rule  d^  and  d^  can  be  computed  as  before. 


5.  A selection  problem  in  life  testing. 


Let  *>e  independent  exponential  populations  with  scale 

parameters  0 ,...,0^,  respectively.  Suppose  n items  from  each  of 
the  k populations  are  put  on  a life  test,  and  the  experiment  is  con- 
tinued until  the  first  r failures  from  0^  (i  = l,...,k)  are  observed. 
This  scheme  of  sampling  is  called  Type  II  censored  sampling. 

Let  x^j,...,x^r  denote  the  first  r naturally  ordered  obser- 
vations from  ir.,  and  set 

l 

r 


T. 

l 


j=l 


x. . + (n  - r)x.  , 
ij  ; ir 


1, 


,k 


(2.5.13) 


T^  represents  the  total  accumulated  life  at  the  termination  of 

the  test  on  population  ir.  . The  sample  likelihood  for  tt  , conditional 

1 i 


on  0^,  is 


T. 

ir 

n!  e- 

£(xil,...,xir|0i)  = (n  _ r)!0r  e > 1 = 


Assume  that  0 = (0  ,...,0^)  has  the  following  a priori 
distribution: 


(2.5.14) 


r H v+11 


g(6)  = 


k 

n 

i=l 


ei  Cefi 

e i 


ur(v) 


, 0 < 0^  < “>,  vi,  v > 0 


Then  the  posterior  distribution  of  0 = (0^,...,0^)  given  T = (Tj,...,T^) 


is 


L 


g*(e|T)  = n 

i=H 


U + T. 
1 


u + T. 


r+v+1  1 


0. 

l 


•i) 


^ + TijJ  (r  + vj 


and 


E(0(i)H)  ■ 


n + Tr 


lil 


r + v - 1 


r + v > l 


where  TflJ  1 •••  1 Tfkj  denote  the  ordered  accumulated  life  times, 
^d  ®(i)  *s  t*ie  Para»>eter  associated  with  Tj.j. 

The  Bayes  rules  d(2)  and  d(3)  can  now  be  computed  as  in 
earlier  examples. 
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CHAPTER  III 

ON  SOME  RULES  BASED  ON  SAMPLE  MEDIANS 
FOR  SELECTION  OF  THE  LARGEST  NORMAL  MEAN 

3.1  Introduction 

Many  of  the  classical  statistical  procedures  which  are  super- 
ior to  their  competitors  under  the  normal  model  have  one  drawback: 
their  behavior  is  seriously  affected  if  a few  gross  errors  are 
present  in  the  sample.  For  example,  consider  the  problem  of  point 
estimation  of  the  mean  0 of  a normal  population.  It  is  well  known 
that  the  sample  mean  is  a uniformly  most  powerful  unbiased  estimate 
of  6,  but  is  not  a very  good  estimate  if  some  wild  observations  are 
present  in  the  sample.  Hodges  and  Lehmann  (1963)  have  proposed  a 
class  of  estimates  which  are  based  on  rank  test  statistics;  these 
estimates  are  approximately  normally  distributed  for  large  samples. 
Gupta  and  Huang  (1974)  have  Investigated  selection  procedures 
based  on  one-sample  Hodges-Lehmann  estimates  of  location  for  the 
problem  of  selecting  a subset  containing  the  largest  t (l<jt«k) 
location  parameters,  when  the  sample  size  is  large.  Gupta  and 
Leong  (1977)  have  discussed  a selection  rule  based  on  sample 
medians  for  double  exponentials.  In  this  chapter  we  consider 
selection  procedures  based  on  sample  medians  for  normal  populations 
For  the  problem  of  selectiag  a subset  containing  the  largest 
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normal  mean,  when  the  means  are  equally  spaced,  a rule  based  on 
medians  Is  compared  to  the  rule  proposed  by  Gupta  (1965) , which  Is 
based  on  sample  means.  It  appears  from  the  numerical  work  done  that, 
as  expected,  Gupta's  rule  is  superior.  This  may  be  due  to  the  fact 
that  the  tails  of  a normal  distribution  are  not  very  thick,  and 
hence  the  probability  of  getting  extreme  observations  is  small.  In 
case  the  underlying  distributions  have  thicker  tails,  e.g., 
logistic,  double  exponential  [see  Gupta  and  Leong  (1977)],  extreme 
observations  are  more  frequent  and  have  a serious  affect  on  the 
sample  means,  but  not  on  the  sample  medians.  In  these  situations, 
sample  median  should  be  a better  estimate  of  location  than  the 
sample  mean,  as  indicated  by  the  fact  that  for  a double  exponential 
population,  the  sample  median  has  smaller  variance  than  the  sample 
mean,  if  sample  size  n » 7 [see  Chu  and  Hotelling  (1955)]. 

Apart  from  having  a simpler  form  for  its  density  function,  the 
sample  median  as  an  estimate  of  location  has  some  other  advantages 
over  the  sample  mean.  Intuitively  any  reasonable  estimate  of  location 
should  have  a distribution  which,  in  some  sense,  is  centered  on  the 
true  location  parameter  value.  It  is  shown  by  Hodges  and  Lehmann 
(1963)  that  the  sample  median  has  a dlstlbutlon  which  is  symmetric 
about  the  true  value  if  the  underlying  distribution  is  symmetric, and 
in  case  the  underlying  distribution  is  not  symmetric,  the  sample 
median  is  a median  unbiased  estimate  of  location, 1. e. , the  median  of 
its  distribution  coincides  with  the  true  location  parameter. 
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In  Section  3.2  we  Introduce  the  notations  to  be  used  In  this 
chapter.  In  Section  3.3  we  Investigate  procedures  based  on  sample 
medians  for  selection  of  a subset  containing  the  largest  normal  mean. 
In  Section  3.4  the  problem  of  selecting  a subset  containing  all  nor- 
mal populations  better  than  a control  Is  discussed,  and  a rule  based 
on  sample  medians  Is  proposed.  In  Section  3.5  the  selection  rule  pro- 
posed In  Section  3.3  is  compared  to  Gupta's  procedure  based  on  sample 
means  [c.f.,  Gupta  (1965)],  when  the  means  are  equally  spaced. 

In  Section  3.6  selection  rules  based  on  medians  of  large  samples 
are  discussed.  In  Section  3.7  asymptotic  relative  efficiency  (ARE) 
of  the  proposed  rule  with  respect  to  Gupta's  rule  for  selecting 
a subset  containing  the  largest  normal  mean  Is  computed,  when  the 
means  are  in  slippage  configuration.  It  is  also  shown  that,  in  case 
the  normal  populations  are  contaminated,  the  proposed  rule  based  on 
sample  medians  improves  on  the  rule  based  on  sample  means  in  terms 
of  ARE. 

In  Section  3.8  a test  of  homogeneity  based  on  sample  medians 
is  proposed,  and  a relation  between  the  test  and  selection  rule  of 
Section  3.3  is  established.  In  Section  3.9  the  distribution  of  a 
statistic  useful  in  some  selection  and  ranking  problems  is  derived. 

3.2  Preliminaries  and  Notations 

Let  X,,...,  X.  ,,  be  independent  observations  from  a population 
1 Zm-ri 

with  cumulative  distribution  function  (cdf)  F(x,0)  and  probability 
density  function  (pdf)  f(x,0),  x,0  e IR,  the  real  line  and  m>l. 

Then  the  sample  median  ft  is  given  by 


* " 1Wi 


where  X[;Lj<  ...  < X[2m+1] 
The  pdf  of  ft  is 


are  ordered  X^. 


g(x,e)  - cm  [F(x,e)]m  [i-F(x,e)]m  f(x,e) 


where  c - 

“ (m!)2 

and  Its  cdf  is  given  by 


(3.2.1) 


G(xfi  ) = c / * [F(u,6)]m  [1-F(u,0)]m  f (u,0)  du 


m 


rF(x,0)  m 


0<y<l 


cm  '0  u «-*>  du  = IF(x>0)  (o+l,  o+l)  p-7(,  (3.2.2) 


where  Iy  (p,q)  is  the  incomplete  beta  function: 

! (v  Q)  » . r7  „P-1  M-u)^-1  du 

y lp,q;  r(p)r(q)  •'o  u u u;  du 


(3.2.3) 


The  following  result  from  Karlin  (1968)  will  be  needed: 


Lemma  3.2.1:  If  f(x,0)  has  monotone  likelihood  ratio  (MLR)  in  x and 
0,  then  g(x,0)  given  by  (3.2.1)  has  MLR  in  x and  0. 

3.3  A Procedure  Based  on  Sample  Medians  for  Selecting  a Subset  Con- 
taining the  Largest  Normal  Mean 

Let  , . . . , be  k independent  normal  populations  with  means 

0^,...,  9^  and  a common  known  variance  o2.  Let  ft^  be  the  sample 

median  of  n » 2m+l  (m>J.)  observations  from  it^.  The  pdf  g and  the 

cdf  G of  ft^  are  obtained  from  (3.2.1)  and  (3.2.2)  by  substituting 

F(x,01)  - f f(x,0)  - ■)  . and  are  given  by 

s<*.V  ■ r 


(3.3.1) 
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C(x,  01)  «=  It(  r9!)  (nri-l.nrt-l) 


where  #(•)  is  cdf  and  ^(*)  is  pdf  of  a standard  normal  random 


variable. 


0[1]=  ”*=  ® [k]  k®  t*ie  ordered  For  the  problem  of  select- 
ing a subset  containing  the  population  assodiated  with  0^,  consider 
the  following  rule  : 

V select  ni  iff  ^ > fc[k]  - d±a  (3.3.3) 

where  < ...<  ^[k]  are  or^eret^  sample  medians,  and  d^>.  0 is  a 

, * 1 * 
constant  depending  on  a preassigned  P (^<  P < 1)  such  that 


P(CS | Rx)  > P 


(3.3.4) 


Let  be  the  sample  median  associated  with  (i=l,...,k). 

p(cs I V " p<*(k)  £*[k)  -di  °> 

“ P$(J)  < 5f(k)  + dx,  j-l,..,k-l) 


'P(5!(j)  ' e[j]  =X(k)  ‘ 6 [k]  + 0[k] 


0 [ j ] "*■  V • • *k~l) 


— /°°  J kii1  I 

a /—  j-x  * (iri- 


9 r,_  i " ® i 


+ dx) 


(nri-l,nrt-l) 


(u)  [1-*  (u)  ]°  (ji  (u)  du 


It  is  clear  from  the  above  expression  that  the  infimum  of  P(CS  | R^ 
occurs  when  all  are  equal,  and  hence  constant  d^  K d^  k,n,P*)  is 
obtained  by  solving 


~ 
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c 

T ll9  (u^)  ]k_1  Au)  [l-«(u)]m 


’<Ku)  du  = P 


(3.3.5) 


Expected  Size  of  the  Selected  Subset 

The  rule  defined  by  (3.3.3)  and  (3.3.4)  selects  a random 
number  of  populations  and  it  is  desirable  that  the  size  of  the  select- 
ed subset  be  small,  and  also  the  ranks  of  the  selected  subset  be 
large,  where  the  population  associated  with  6^  is  given  rank 

(i=l k) . Gupta  (1965)  has  proposed  the  expected  size  of  the 

selected  subset  and  expected  sum  of  ranks  of  selected  populations 
as  criteria  of  efficiency  of  selection  rules. 

Let  S and  SR  be  random  variables  denoting  the  size  of  the 
selected  subset  and  the  sum  of  ranks  of  the  selected  populations, 
respectively.  Then  following  Gupta  (1965)  we  have 


and 


E (S|R  ) - l P (i) 

e i=i  - 


E0  (SR|R1)  = l i P0(i) 
— i=l  — 


(3.3.6) 


(3.3.7) 


where  P0  (i)  is  the  probability  that  the  rule  selects  the  popu- 
lation associated  with  e^j,  i=l,...k,  and  is  given  by 


V1’  ■ f I. 


- k 


,-e, 


^ Vui  [1]g  + v (nrt‘1*mfl) 


•m(u)  [l-$(u)]m  <f>(u)  du 


(3.3.8) 
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Some  Properties  of  R_ 


(i)  Upper  Bound  on  E0 (S | R^) 

It  follows  from  Lemma  3.2.1  that  g(x,6  ) given  by  (3.3.1) 
has  MLR  in  x and  6.  Hence,  from  Theorem  1 of  Gupta  (1965),  ve  have 

Max  Efi(s|R  ) - Max  Efi(s|R.)  - kP* 

0*e  - 1 QtQ  - 1 

- 0 

where  0Q  = {(01 0fc)  e 0:  02  - ...  - 0fc}. 

(ii)  Property  of  Monotonicity 

It  is  clear  from  expression  (3.3.8)  that  the  rule  Rj^ 
is  strongly  monotone  [see  Santner  (1975) J,  i.e.,  P0(i)  is  non- 
decreasing (nonincreasing)  in  0^  <e[j]»  j**1)*  when  other  compo- 

nents of  j}  remain  fixed.  The  following  two  properties  of  R^  are 
immediate  consequences  of  strong  monotonicity: 

(a)  Monotonicity:  P0(i)  > P0(j)  V l<j<i^c 

(b)  Unbiasedness:  P^(k)  >,  P0(i)  V l<J.<k 

(iii)  Minimaxi ty  with  Respect  to  the  Expected  Subset  Size 


A selection  rule  R*  is  said  to  be  minima x with  respect 


to  S if 


sup  E (S|R*)  - inf  sup  Efl(S |r) 

0 - R 0 - 

where  inf  is  over  all  selection  rules  which  satisfy  the  P*-condition 
[see  Berger  (1977  ]. 

The  density  function  g(x,©i)  given  by  (3.3.1)  is  clearly 
of  location  type,  and  it  has  MLR  in  x and  01  by  Lemma  3.2.1.  It  fol- 
lows from  Theorem  1.4.2  of  Berger  (1977)  that  the  selection  rule 
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^ is  minimax  with  respect  to  S among  all  rules  based  on  sample 
medians. 

3.4  Selection  of  a Subset  Containing  All  Normal  Populations 
Better  Than  a Control 

Here  we  have  k+1  independent  normal  populations  . . . .rr^, 

where  ir^  has  mean  0^  and  a known  variance  a (i«l,...,k). 

The  population  irQ  is  a standard  or  control  population;  0^  may  or 
may  not  be  known.  The  population  it  is  said  to  be  better  than  control 
if  6^  Our  interest  is  in  the  subset  of  all  populations  which  are 

better  than  control.  Gupta  (1965)  has  investigated  a rule  based  on 
sample  means  which  selects  a subset  that  contains  all  normal  popula- 
tions better  than  control  with  probability  at  least  P*.  We  propose 
a rule  based  on  sample  medians.  Two  cases  are  possible: 

A.  0q  known 

We  are  given  medians  ^ of  n ■ 2m  + 1 (m>  1)  independent 

observations  from  n (i-l,..,k).  Coasider  the  rule  R.  defined  as 
1 A 

follows: 

Ra:  select  tt±  iff  ^ > 0Q  - aa  (3.4.1) 

whete  a is  chosen  so  as  to  satisfy  the  basic  P condition.  Let  k^,  k£ 
denote  the  true  (unknown)  number  of  populations  with  0^  0Q  and 
9i  < V re8Pectlvely-  Then  + k£  ■ k.  Also  let  primes  refer  to  the 
k^  populations  which  are  better  than  control.  Then 

kl 

P(CS|R.)  - n P(X’>0n-ao) 

•A  1.1  i *=  0 


It  Is  clear  from  (3.4.2)  that  the  minimum  of  P(CS|r^)  under  the 
restriction  0^>0Q  occurs  at  0j-  0Q  , 1-1,... 1^,  and  therefore  a 
lower  bound  on  P(CS|ra>  Is  obtained  by  setting  0'-  0 and  k -k: 


Hence  a is  obtained  from  the  equation 


The  expected  subset  size  for  the  rule  R 


E(S|RA)  * l P(iri  is  selected) 


In  this  case  2m+l  independent  observations  are  taken 
from  ir^.  Let  be  the  median  of  this  sample.  We  propose  the  follow- 
ing rule 


select  v.  iff 


where  constant  b is  chosen  to  satisfy  P condition.  We  have,  as  in 


(m+l,nrt-l)  ] 


c k. 

a t 00 


P(CS|V  - f J_  y U-i  60  -e; 


i*l  # (u+ 


- b) 


•4^(11)  [l-$(u)  ]m  <p(u)  du 


i -?/-  n-V-bi  <-».**•«  r 


•4>m(u)[l-  $(u)]m  4>(u)  du 


(3.4.6) 


(3.4.7) 


The  constant  b is  obtained  by  equating  right  hand  side  of  (3.4.7)  to 

it 

P . The  expected  subset  size  for  the  rule  Rg  is  obtained  as  in  Case  A. 
Remarks : 

(i)  It  is  clear  from  expressions  for  P (select  6^  for  rules 

R.  and 
A 

P(select  0^  >_  P(select  0^)  if  0^  0^ 


Rg  that,  in  either  case 


(ii)  If  e. 


in  each  case. 


= l,...,k  and  0Q  is  finite,  then  E(S)  K 


3.5  Comparison  between  R^  and  Gupta's  Procedure  Based  on  Sample 
Means  when  the  Normal  Means  Are  Equally  Spaced 

Let  tt^  be  k independent  normal  populations  with  means 

2 

0,0  + 6a,..., 0 + (k-l)6a  and  a common  known  variance  a ;6>  0 is  a 
known  constant.  Let  X±J  (j«l,...,n)  be  a sample  of  size  n»2mH(m»l) 
from  ir^  (i*»l,...,k),  and  let  X^,  X^  be  the  median  and  the  mean  of 


observations  from  it^: 
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" *I  *fl]  Wh*r*  *[1J  “ •*-  x[2nrfl] 


2m+l 

I X 

J-l  U 

2mfl 


For  the  problem  of  selecting  a subset  containing  the  largest  mean 
0+(k-l)6o,  Gupta  (1965)  has  proposed  the  following  rule  R: 


R:  select  it.  iff  X.  > Xr,  . ^ — 

1 1 " lkI  /1ST 


(3.5.1) 


where  d>P  is  given  by 


l 1 (u+d)  <j,(u)  du  * P 


(3.5.2) 


We  will  compare  the  rule  R to  the  rule  R1  defined  by  (3.3.3)  and 
(3.3.5). 

Let  P(i,k,P  ,6  ,n|R')  denote  the  probability  with  which 
a rule  R'  selects  the  population  associated  with  the  i-th  largest 
mean  (i=>l k) . Then  from  Gupta  (1965)  we  have 

P(i»k,P*, 6 ,n|R)  - P(i,k,P *,«  Jn  |r) 

-/  [ U *(x+d-(j-i)6v£)  ]<f>(x)  dx]  (3.5.3) 

j-1 

The  expression  for  P,(i,k,P*,6  ,n|R^)  is  obtained  from  (3.3.8) 
by  substituting  0^j  = 0+(i-l)6o  , 1*1,..., k: 


P(i,k,P*,6,n|lL)-  -2.  I [ n I , fmflnH.ni 

i+i 


>*m(u)[l-*(u)]m  (u)  du 


(3.5.4) 


For  the  rule  R^  the  probabilities  P(l,k,P*,5,n|R1)  given  by 

(3.5.4)  have  been  confuted  for  k-2(l)5,  n-3,5,  6-O.J  (0.5)  5.0 

* 

and  P « -90,. 95.  The  numerical  Integration  was  done  by  the  Gauss- 
Hermlte  Integration  formula.  Tables  are  given  at  the  end  of  this 
chapter  .Tables  of  P(i,k,P?6,a|R)  are  available  in  Gupta  (1965). 

Next,  let  Y(k,P*,6,n|R’)  and  ^(k.pfd.nlR')  denote  the 
expected  sum  of  ranks  and  expected  average  rank  of  the  selected 
populations  for  a rule  R' , respectively.  Then 


(3.5.5) 


For  the  rule  R,  tables  of  values  of  y and  the  expected 
proportion  of  the  population  retained  in  the  subset  (»  ^ E(s|r)  ) 
are  available  In  Gupta  (1965) . We  have  computed  the  values  of 
these  functions  for  the  rule  R^,  for  values  of  k,n,<5  and  P*  men- 
tioned above.  For  instance  if  P*-.  90,  k-5,  n-3,  6-1 .5/v/3~  , 
then  the  rule  R based  on  sample  means  selects  the  second  best  and 
third  best  populations  with  probabilities  .781  and  .357,  respective- 
ly. The  corresponding  probabilities  for  the  rule  ^ are  .822  and 
.467,  in  that  order.  The  probability  of  correct  selection 
(selecting  the  beet)  has  to  be  greater  than  .90  for  both  the 
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rules  and  Is  actually  equal  to  .998  for  the  rule  R,  and  .997 
for  the  rule  R^. 

It  appears  from  these  tables  that  the  rule  R based  on  sample 
means  is  superior  to  the  rule  R^  based  on  sample  medians,  and  that, 
a6  expected  , the  performance  of  R relative  to  Rj^  improves  Aft  sample 
size  is  increased. 

Remarks : 


' 


(i)  It  is  shown  in  Gupta  (1965)  that  the  rule  R has  the 
following  desirable  properties: 

(1)  For  fixed  P*  and  k 

P(l,k,P*,6,n)  + 6 /n 

and 

P(k,k,P*,6,n)  + 6 

(2)  For  fixed  P ,i,6  and  n 
P(i,k,P*,6,n)  + k,  lii<Jt 

(3)  For  fixed  k,P  and  (i-j)6 
lim  » (P*,k,6,n)  - k 

n-H» 

It  is  clear  from  expression  (3.5.4)  that  the  rule  Rj^  has  similar 
properties. 

(ii)  It  follows  from  (3.5.4)  and  (3.5.5)  that 


* c k 

(A)  Y(P  ,k,6,n|R  )>  -S  J if 

° i-1 


k 

tj"2I*(x+d1-(j-I)6) 


(m+l,mfl)  ] 


♦“(x) [l-*(x)]“  $(x)  dx 
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(B)  y(P  ,k,6,n|R1)> 


k(k+l)c 


*/-«>  ^I*(x+d1-(k-l)6)  (nH‘1»m+1>]k  1 
•$m(x)[l-$(x)]m  *(x)  dx 

3.6  A Selection  Rule  Based  on  Medians  of  Large  Samples 

Let  f (x)  be  the  pdf  of  a continuous  random  variable  X and  let 
C be  its  unique  median.  Let  be  the  sample  median  of  n=2mfl  obser- 
vations on  X.  The  distribution  of  X under  certain  conditions  on 
f (x)  is  known  [see  Cramer  (1946)]  to  be  asymptotically  normal  with 


mean  £ and  variance 


4[f(C)]  (2m+l) 


The  above  result  will  be  used  to  investigate  a rule  for  selecting 

a subset  containing  the  largest  normal  mean. 

% 

Let  be  the  sample  median  of  (2m+l)  independent  observa- 
tions from  (i=l,...,k),  where  is  normal  with  mean  0^  and 
a known  variance  a2.  Then,  for  large  n,  is  asymptotically 


normal  with  mean  6^  and  variance 


2 

wo 

2 (2m+l) 


For  the  problem  of  selecting  a subset  containing  6 we  propose 
the  following  rule: 


R^:  select  iff  ft  -d2<J 


(3.6.1) 


2(2m+l) 
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where  is  to  be  determined  from  the  basic  P condition. 


We  have 


p(cs|r2)  - P(3E  (k)  >*[k]-d2 


2(2mfl) 


k-i  0 ri  ,-e . 
= / 00  [ n $(u+  — I 
i=l  ay it 


+ t^)  ] d>  (u)  du 


/2(2nrfl) 


(3.6.2) 


It  follows  from  (3.6.2)  that  the  equation  for  d2  is 


/ '''(u+d,)  <J>  (u)  du  = P 


(3.6.3) 


Tables  of  values  of  d2  satisfying  (3.6.3)  for  selected  values  of  P 
are  available  in  Bechhofer  (1954)  for  k = 1(1)10  and  in  Gupta  (1956) 
for  k = 1(1)50. 


Expected  Size  of  the  Selected  Subset 


E(S I R_)  = 7 P (population  associated  with  0 , , is  selected) 

1 1=1  L U 


k k 0 r.  r 

= y / “ IT  4>(u+  L 

i=l  j-1  QvV 

/2(2mfl) 


+ d2)  d> (u)  du 


(3.6.4) 


3.7  Asymptotic  Relative  Efficiency  (ARE)  of  the  Rule  R_  Relative 
to  Gupta's  Rule  R 


(i)  Normal  Populations 


Let  ir^,...,  be  normal  populations  with  means  0^,...,0^, 

2 

respectively  and  a common  known  variance  o . Assume  that  0^(i=l,..,k) 
are  in  foil owing  slippage  configuration: 


79 


9i  = i 


0+0 A if  i=i  ; A>0  is  unknown 
o 

9 if  if«i 


The  population  ir  has  the  largest  mean  and  is  called  the  'best' 
o 

population;  the  index  iQ  is  not  known. 

Our  interest  is  in  the  relative  performance  of  the  following 
two  selection  rules: 

d„o  /rT 

R„:  select  tt.  iff  X.  >.  max  X. 

2 1 1 T<j<k  J /2n 


R : select  it  iff  X.  > max  X. 

~ l^j<k  j /T 


where  is  the  median  and  X.  the  mean  of  n=2m+l  observations 
l l 

from  i=i,...,k,  m^l.  The  constants  d and  d ^ are  given  by 
equations  (3.5.2)  and  (3.6.3),  respectively.  It  is  obvious  that 


Let  S be  the  number  of  non-best  populations  in  the  selected  subset. 

•k 

Then  small  values  of  S are  desirable,  and  therefore,  consistent 
* 

with  the  basic  P -condition,  we  would  like  to  keep  the  expected 

•k 

value  of  S as  small  as  possible. 

It  is  intuitively  clear  that  the  performance  of  any  reasonable 

election  rule  R'  should  improve  as  the  sample  size  is  increased. 

r > zlven  0<e<l,  let  N , (e)  be  the  number  of  observations  needed 

R 


EfS  V)  “ e 


« that 
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We  will  use  the  following  definition  of  ARE  [see  McDonald 
(1969)]: 


Definition  3.7.1:  The  ARE  of  the  rule  R^  relative  to  the  rule  R 
is  defined  to  be 


ARE(R2,R;0  ) = lim 


NR(e) 


(3.7.2) 


e+0  N (E) 
R2 


We  have 


k-1 


E (S  |R2) 


l > k[k]  - d2  -wj  > 

fa 


k-1 


■ L P^(j)  = ^(i)+d2  g|/~*  > j=1 k»  j^i) 

1 fa 


!(k *(« £ + d ) $k'2(u+d.)  <Ku)du 

/rr  * z 


fa 


By  definition  of  N (e)  we  have 


(k-1 )/_“  *(u- 


/n 

fa~U) 

R2 


+ dj)  <I>k  2(u+d2)  <f>(u)  du  = e 


(3.7.3) 


Similarly 


E(S  |R)  - (k-1)  / ^ <t(u-  +d)  4>k  2 (u+d)  (u)  du 

7I7n 


and  therefore 


(k— 1)  / 4>(u- 

• — 00 


/l/NR(e) 


+d)  $ (u+d)  4>  (u)  du  = e 


(3.7.4) 


Since  d2=d,  (3.7.3)  and  (3.7.4)  lead  to 


/ ” [ $( u ^ + d)  - Hu  - 

* — OO  f - 


^/2N  (€) 

R2 

1 ^ ^ (u+d)  4>  (u)  du  = 0 


/l/NR(e) 


+ d)] 


(3.7.5) 


Using  the  fact  that  $ is  strictly  increasing,  it  can  be  seen  from 
(3.7.5)  that 


2NR^(e)  NR(e) 


and  hence 


ARE  (R  ,R)  = lim  N.(e)/N_  (e) 
2 e+0  R R2 


(ii)  Contaminated  Normal  Populations 


(3.7.6) 


Suppose  in  the  course  of  sampling  from  population  tt^ 
(i=l,...,k)  something  happens  to  the  system  and  gives  rise  to 
wild  observations.  In  these  situations  the  pdf  of  can  be  written 


f(x,9i)  = af1(x,9i)  + (1-a)  f2(x,9i),  0<a<l 


(3.7.7) 
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This  means  that  the  experimenter  is  sampling  from  a population  with 
pdf  f^(x,0^)  100  a percent  of  the  time,  and  from  f2(x,0i)  with 
100  (1-a)  percent  of  the  time  (i=l,...,k).  The  presence  of  obser- 


vations from  f2(x,0^)  is  termed  as  contamination.  For  our  discussion 
we  will  assume  that 

fi(x*V  = 5 

L_  . (X~9i  X 


where 


f.(x,0  ) = 

2 1 /be  /b 


(i=l, . . . ,k) 


0 + Ac 


if  i=i  (i  is  unknown), A>  0 unknown 
o o 

if  i^i 


Given  n independent  observations  from 

, let  3!^  be  the  median  and  the  mean  from  ik,  i=l,...,k, 

where  n>3  is  an  odd  integer.  It  is  known  [see  Rohtagi  (1976)] 

that  ^ and  both  are  asymptotically  normal,  each  with  mean  0^  and 
%2  —2 

variances  a and  a , respectively,  where 


[ 

? 


-v  2 
a 


- 2 
a 


7T0 


2n 


{a+ 


(3.7.8) 


sr 


[a  +(1-  o)b] 


(3.7.9) 


For  the  problem  of  selecting  a subset  containing  the  (unknown) 


best  population  , consider  the  following  two  rules: 
o 

R*  : select  ir^  iff  £ 5^]  - dj  o 


* — — * — 
R : select  iff  X^  >.  X^j  ~ d o 
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It  is  easy  to  see  that  and  d*  both  satisfy  the  equation 

(3.5.2)  and  hence 

3*f  ★ 

d^  = d = d,  say. 

Then 

E(S  | R2 ) = (k-1)  j $(u-  + d)  4>k  (u+d)  <p(u)  du 

a 

(3.7.10) 

and 

E(S*|R*)  = (k-1)  / ” $(u-  ^£.  + d)  $k~2  (u+d)  <J>(u)  du 

a 

(3.7.11) 

Equating  the  right  hand  sides  of  (3.7.10)  and  (3.7.11)  to  e and 
solving  for  NR*(e)  and  NR^(e)  we  get 

2 2 

1 _ a [a+  (l-ci)b) 

) 

2_  [a+  (1-a)  b]  [a+  ^-1 2 

" Ah' 

-+  <x>  as  b ■+■  * . 

This  shows  that  for  large  values  of  b the  rule  R*  based  on  medians 
is  much  better  than  the  rule  R based  on  sample  means.  In  fact, 
it  can  be  seen  from  a result  in  Rohtagi  (1976)  that  the  ARE  (R*,R) 
is  close  to  1 when  b=9  and  a=. 915,  and  as  the  differences  b-9>0 


\.<c) 


NR*  <e) 
R2 
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7* 

or  .915  -ot>  0 increase  the  rule  shows  a significant  improvement 


over  R in  terms  of  the  ARE. 


3.8  A Test  of  Homogeneity  Eased  on 


Let  be  k independent  normal  populations  with 

means  0^,...,  0^,  respectively,  and  a common  known  variance  o^ 
As  before,  let  denote  the  sample  medians  of  n=2m+l  (m^l) 
observations  from  rr^  (i=l,...,k),  and  j , . . . be  their 

ordered  values.  For  the  hypothesis 

H : 0 , = . . .=0, 
o 1 k 


consider  the  following  test: 

Reject  H if  R=$ri  , > Y 

o [k]  [1] 

We  find  the  constant  y so  that 

PH  ( R>Y  ) < cx 

o 

where  a is  the  size  of  the  test. 


The  following  result  gives  the  constant  y,  and  also  establishes  a 
relationship  between  the  test  given  by  (3.8.1)  and  the  selection 
rule  R^  of  Section  3.3. 


Theorem  3.8.1 


For  0<ot<l,  let  y satisfy 

\ <*k  i *[10  - T)  i !-  f 


PH  (R  >Y)  4 a 

o 


Proof:  The  proof  is  similar  to  that  of  Theorem  6.1  of  Gupta  and 
Leong  (1977),  and  hence  omitted. 

3.9  On  the  Distribution  of  the  Statistic  Associated  with  R 
when  the  Underlying  Distributions  Are  Normal 

'V/ 

Let  X±  (i=0,l, . . . ,k)  be  k+1  sample  medians  of  n=2nri-l  (m:>l) 
independent  observations  from  a standard  normal  population. 

Define 

\ = \ ~ \ (i=l, . . . ,k) 

The  r.v.’s  Z±  are  correlated  and  the  distribution  of  Z = max  Z is 

l_<i<k  1 

needed  in  some  ranking  and  selection  problems.  For  standard 
double  exponential  population  the  distribution  of  Z has  been 
computed  by  Gupta  and  Leong  (1977)  for  selected  values  of  k,n,  and  a. 
In  this  section  we  give  an  expression  for  the  distribution  of  Z, 
and  also  provide  a short  table  for  its  upper  percentage  points 
for  P*=a  =.75, .85, .90, .95, .99,  k=2(l)5,  n=3(2)ll. 

Let  F( • ) be  the  cdf  of  Z.  Then  F(z)  =P(Z<  z)=P(l<5(o+z,i=l,..,k) 

It  is  easy  to  see  that 

*00-  cm  I -I  (^(z+x)  Ax)  [l-4>(x)  ]m  4»(x)  dx 

(3.9.1) 

Computations  for  upper  percentage  points  of  F were  done  using 
Gauss-Hermite  quadrature  based  on  20  nodes  for  the  numerical 


integration. 
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TABLE  IA 

For  the  rule  R and  the  configuration  ( 0,  0 + $CT  0 + (k-1)  g0) 

this  table  gives  the  probability  of  selecting  the  normal  population 
with  rank  i when  the  population  with  mean  0 + (i-l)6a  has  rank 
i , i=l,2, . . . ,k;the  common  variance  02  is  assumed  to  be  known. 

P*  - .90.  n - 3 


5/n 

.5 

1.0 

1.5 

2.0 

2.5 

3.0 

3.5 

4.0 

4.5 

5.0 

k i 
2 1 

.836 

.749 

.643 

.524 

.404 

.291 

.196 

.123 

.072 

.039 

2 

.944 

.971 

.986 

.994 

.997 

.999 

1.000 

1.000 

1.000 

1.000 

3 1 

.778 

.585 

.359 

.171 

.061 

.016 

.003 

.000 

.000 

.000 

2 

.882 

.828 

.745 

.640 

.521 

.400 

.288 

.194 

.121 

.070 

3 

.959 

.983 

.993 

.997 

.999 

1.000 

1.000 

1.000 

1.000 

1.000 

4 1 

.704 

.381 

.118 

.019 

.001 

.000 

.000 

.000 

.000 

.000 

2 

.818 

.648 

.423 

.217 

.084 

.024 

.005 

.001 

.000 

.000 

3 

.907 

.865 

.793 

.697 

.583 

.462 

.344 

.240 

.156 

.094 

4 

.969 

.989 

.996 

.998 

.999 

1.000 

1.000 

1.000 

1.000 

1.000 

5 1 

.612 

.193 

.019 

.001 

.000 

.000 

.000 

.000 

.000 

.000 

2 

.740 

.425 

.142 

.024 

.002 

.000 

.000 

.000 

.000 

.000 

3 

.845 

.688 

.467 

.251 

.102 

.031 

.007 

.001 

.000 

.000 

4 

.923 

.887 

.822 

.733 

.624 

.504 

.384 

.274 

.183 

.113 

5 

.975 

.992 

.997 

.999 

1.000 

1.000 

1.000 

1.000 

1.000 

1.000 

TABLE  IIA 

For  the  rule  ..  R and  the  configuration  ( 0,0  + 0 + (k-1)  ) 

this  table  gives  the  expected  average  rank  of  the  selected  subset  (top) 
and  the  expected  proportion  of  the  populations  selected  in  the  subset 
(bottom)  when  the  normal  population  with  mean  0 + (i-1)  has  rank 
i , i=l,2, . . . ,k; the  common  variance  is  assumed  to  be  known. 

P*  = .90  . n = 3 


.5 

1.0 

1.5 

2.0 

2.5 

3.0 

3.5 

4.0 

4.5 

5.0 

k 

2 

1.361 

.890 

1.345 

.860 

1.307 

.814 

1.256 

.759 

1.199 

.701 

1.145 

.645 

1.098 

.598 

1.061 

.562 

1.036 

.536 

1.019 

.519 

3 

1.806 

.873 

1.730 

.799 

1.610 

.699 

1.481 

.603 

1.367 

.527 

1.272 

.472 

1.193 

.430 

1.129 

.398 

1.081 

.374 

1.047 

.357 

4 

2.234 

.849 

2.057 

.721 

1.831 

.582 

1.634 

.483 

1.479 

.417 

1.358 

.372 

1.261 

.337 

1.180 

.310 

1.117 

.289 

1.071 

.274 

5 

2.639 

.819 

2.323 

.637 

1.995 

.489 

1.745 

.401 

1.561 

.346 

1.422 

.307 

1.311 

.278 

1.220 

.255 

1.146 

.237 

1.091 

.223 
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TABLE  IB 

For  the  rule  R^  and  the  configuration  ( 0,  0 + to  ,...,0  + (k-l)fio  ) 
this  table  gives  the  probability  of  selecting  the  normal  population 
with  rank  i when  the  population  with  mean  0 + (i~l)6a  has  rank 
i , i=l,2, . . . ,k; the  common  variance  a ^ is  assumed  to  be  known 

P*  , p = 3 


6/n 

.5 

1.0 

1.5 

2.0 

2.5 

3.0 

3.5 

4.0 

4.5 

5.0 

i 

2 1 

.910 

.850 

.768 

.665 

.548 

.427 

.312 

.213 

.136 

.080 

2 

.974 

.988 

.995 

.998 

.999 

1.000 

1.000 

1.000 

1.000 

1.000 

3 1 

.871 

.718 

.500 

.277 

.118 

.037 

.009 

.001 

.000 

.000 

2 

.938 

.902 

.842 

.758 

.653 

.535 

.414 

.301 

.204 

.129 

3 

.982 

.993 

.998 

.999 

1.000 

1.000 

1.000 

1.000 

1.000 

1.000 

4 1 

.784 

.477 

.173 

.033 

.003 

.000 

.000 

.000 

.000 

.000 

2 

.875 

.732 

.517 

.292 

.126 

.041 

.010 

.002 

.000 

.000 

3 

.940 

.909 

.851 

.770 

.668 

.551 

.430 

.315 

.216 

.138 

4 

.982 

.994 

.998 

.999 

1.000 

1.000 

1.000 

1.000 

1.000 

1.000 

5 1 

.742 

.306 

.043 

.002 

.000 

.000 

.000 

.000 

.000 

.000 

2 

.842 

.565 

.235 

.052 

.006 

.000 

.000 

.000 

.000 

.000 

3 

.914 

.798 

.602 

.369 

.176 

.063 

.016 

.003 

.000 

.000 

4 

.961 

.938 

.894 

.828 

.739 

.631 

.512 

.391 

.281 

.188 

5 

.990 

.997 

.999 

1.000 

1.000 

1.000 

1.000 

1.000 

1.000 

1.000 

TABLE  I IB 

For 

the  rule  R • 

and  the  configuration 

( 0,  0 

+ 6a  , 

....  0 

+ (k-l)6a  ) 

this 

; table  gives 

the  expected  average 

rank  of 

the  selected 

subset 

(top) 

and 

the  expected  proportion  of 

the  populations  selected  in 

the  subset 

(bottom)  when  the 

normal 

population  wi 

th  mean 

0 + (i-l)6a 

has  rank 

i , 

i-1,2. 

...,k;the  common  variance  a 

z is  assumed  to  be  known. 

* 

P 

.=  -9?  , 

n = 3 

ft/n 

.5 

1.0 

1.5 

2.0 

2.5 

3.0 

3.5 

4.0 

4.5 

5.0 

k 

2 

1.429 

1.413 

1.379 

1.330 

1.273 

1.213 

1.156 

1.107 

1.068 

1.040 

.942 

.919 

.881 

.831 

.774 

.713 

.656 

.607 

.568 

.540 

3 

1.898 

1.834 

1.725 

1.597 

1.474 

1.369 

1.279 

1.201 

1.136 

1.086 

.930 

.871 

.780 

.678 

.590 

.524 

.474 

.434 

.401 

.376 

4 

2.321 

2.161 

1.938 

1.731 

1.565 

1.434 

1.327 

1.237 

1.162 

1.103 

.895 

.778 

.635 

.523 

.449 

.398 

.360 

.329 

.304 

.284 

5 

2.792 

2.514 

2.178 

1.904 

1.699 

1.543 

1.419 

1.315 

1.225 

1.150 

.890 

.721 

.555 

.450 

.384 

.339 

.306 

.279 

.256 

.238 

For  the  rule  R and  the  configuration  (9,  9 + 6a  , ...,9  + (k-l)6a  ) 
this  table  gives  the  probability  of  selecting  the  normal  population 
with  rank  i when  the  population  with  mean  9 + (i-l)6CT  has  rank 


i=l,2, . . . ,k; the  common  variance 

.90 


* 

P = 


is  assumed  to  be  known, 
n = 5 


6/n 

.5 

1.0 

1.5 

2.0 

2.5 

3.0 

3.5 

4.0 

4.5 

5.0 

k i 
2 1 

.838 

.754 

.653 

.539 

.422 

.312 

.216 

.139 

.084 

.047 

2 

.943 

.969 

.985 

.993 

.997 

.999 

1.000 

1.000 

1.000 

1.000 

3 1 

.782 

.596 

.379 

.191 

.073 

.021 

.005 

.001 

.000 

.000 

2 

.882 

.832 

.753 

.652 

.538 

.422 

.311 

.215 

.139 

.084 

3 

.958 

.982 

.993 

.997 

.999 

1.000 

1.000 

1.000 

1.000 

1.000 

A 1 

.710 

.400 

.134 

.024 

.002 

.000 

.000 

.000 

.000 

.000 

2 

.822 

.657 

.442 

.239 

.098 

.031 

.007 

.001 

.000 

.000 

3 

.907 

.868 

.799 

.708 

.599 

.483 

.368 

.264 

.177 

.110 

4 

.967 

.988 

.995 

.998 

.999 

1.000 

1.000 

1.000 

1.000 

1.000 

5 1 

.620 

.214 

.025 

.001 

.000 

000 

.000 

.000 

.000 

.000 

2 

.746 

.444 

.161 

.031 

.003 

.000 

.000 

.000 

.000 

.000 

3 

.848 

.698 

.486 

.274 

.119 

.039 

.010 

.002 

.000 

.000 

4 

.922 

.890 

.828 

.743 

.639 

.525 

.408 

.299 

.205 

.131 

5 

.974 

.991 

.996 

.999 

.999 

1.000 

1.000 

1.000 

1.000 

1.000 

TABLE  IIC 

For  the  rule  R and  the  configuration  ( 9,  9 + 6a  ,...,9  + (k-l)6o  ) 
this  table  gives  the  expected  average  rank  of  the  selected  subset  (top) 
and  the  expected  proportion  of  the  populations  selected  in  the  subset 
(bottom)  when  the  normal  population  with  mean  9 + (i-l)6a  has  rank 
i , i=l,2, . . . ,k; the  common  variance  ^ is  assumed  to  be  known. 

P*  = .90  . n = 5 


6/n  .5 

1.0 

1.5 

2.0 

2.5 

3.5 

MB 

5.0 

k 

2 1.361 

.890 

1.346 

.862 

1.311 

.819 

1.262 

.766 

1.208 

.710 

1.155 

.655 

1.107 

.608 

1.069 

.569 

1.042 

.542 

1.023 

.523 

3 1.807 

.874 

1.735 

.803 

1.621 

.708 

1.495 

.613 

1.382 

.537 

1.288 

.481 

1.209 

.439 

1.144 

.405 

1.093 

.380 

1.056 

.361 

4-  2.236 

.851 

2.068 

.728 

1.850 

.593 

1.654 

.492 

1.499 

.425 

1.378 

.378 

1.280 

.344 

1.198 

.316 

1.132 

.294 

1.083 

.278 

5 2.643 

.822 

2.342 

.647 

2.019 

.499 

1.770 

.409 

1.583 

.352 

1.443 

.313 

1.332 

.284 

1.240 

.260 

1.164 

.241 

1.105 

.226 

- 
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TABLE  ID 

For  the  rule  and  the  configuration  ( 0,  0 + 6a  ,...,0  + (k-l)6cy  ) 
this  table  gives  the  probability  of  selecting  the  normal  population 
with  rank  i when  the  population  with  mean  0 + (i-l)6cr  has  rank 
i , i=*l,2, . . . ,k;  the  common  variance  a^  is  assumed  to  be  known. 

* 

Z - .95  , n = 5 


6/n 

.5 

1.0 

1.5 

2.0 

2.5 

3.0 

3.5 

4.0 

4.5 

5.0 

k i 
2 1 

.912 

.854 

.776 

.678 

.566 

.449 

.337 

.236 

.155 

.095 

2 

.974 

.987 

.994 

.998 

.999 

1.000 

1.000 

1.000 

1.000 

1.000 

3 1 

.875 

.729 

.521 

.304 

.137 

.047 

.012 

.002 

.000 

.000 

2 

.939 

.905 

.848 

.769 

.670 

.557 

.441 

.328 

.230 

.150 

3 

.981 

.993 

.997 

.999 

1.000 

1.000 

1.000 

1.000 

1.000 

1.000 

4 1 

.824 

.543 

.230 

.053 

.006 

.000 

.000 

.000 

.000 

.000 

2 

.901 

.778 

.582 

.359 

.174 

.064 

.018 

.004 

.001 

.000 

3 

.953 

.928 

.881 

.811 

.721 

.615 

.499 

.383 

.277 

.187 

4 

.986 

.995 

.998 

.999 

1.000 

1.000 

1.000 

1.000 

1.000 

1.000 

5 1 

.753 

.335 

.054 

.003 

.000 

.000 

.000 

.000 

.000 

.000 

2 

.849 

.585 

.264 

.065 

.009 

.001 

.000 

.000 

.000 

.000 

3 

.917 

.809 

.622 

.399 

.202 

.078 

.023 

.005 

.001 

.000 

4 

.962 

.941 

.899 

.837 

.754 

.652 

.539 

.422 

.311 

.215 

5 

.989 

.997 

.999 

1.000 

1.000 

1.000 

1.000 

1.000 

1.000 

1.000 

TABLE  IIP 


For  the  rule  R and  the  configuration  ( 0,  0 + 6a  ,...,0  + (k-l)6a  ) 
this  table  gives  the  expected  average  rank  of  the  selected  subset  (top) 
and  the  expected  proportion  of  the  populations  selected  in  the  subset 
(bottom)  when  the  normal  population  with  mean  0 + (i-l)6a  has  rank 
i , i*l,2, . . . ,k; the  comnon  variance  is  assumed  to  be  known. 


* 


P = 

•9? 

6/n 

.5 

1.0 

1.5 

2.0 

2.5 

3.0 

3.5 

4.0 

4.5 

5.0 

k 

2 

1.429 

1.414 

1.382 

1.336 

1.282 

1.224 

1.168 

1.118 

1.078 

1.047 

.943 

.921 

.885 

.838 

.783 

.724 

.668 

.618 

.578 

.547 

3 

1.899 

1.839 

1.736 

1.613 

1.492 

1.387 

1.298 

1.220 

1.153 

1.100 

.932 

.876 

.789 

.690 

.602 

.535 

.484 

.444 

.410 

.383 

4 

2.357 

2.216 

2.007 

1.801 

1.629 

1.493 

1.383 

1.289 

1.208 

1.140 

.916 

.811 

.673 

.556 

.475 

.420 

.379 

.347 

.319 

.297 

5 

2 . 799 

2 . 536 

2.208 

1.935 

1.728 

1.569 

1.445 

1.340 

1.250 

1.172 

. 894 

.733 

.568 

.461 

.393 

.346 

.312 

.785 

.262 

. 24  3 

ILP^i 
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* TABLE  III 

Upper  100(1  - P ) percentage  points  of  Z = max  (X.  - X ) where 

lSi&  1 

Xq>X^,...,X^  are  iid  sample  median  random  variables  in  samples 


of  size  n = 2mfl 

( m > 1 ) 

from  the  standard 

normal  < 

distribution. 

kn  ^ 

5 

7 

9 

11 

.638 

.511 

.445 

.409 

.393 

.980 

.784 

.676 

.614 

.582 

1 1.213 

.969 

.832 

.751 

.710 

1.558 

1.245 

1.065 

.956 

.900 

2.208 

1.766 

1.522 

1.398 

1.491 

.959 

.768 

.667 

.610 

.579 

1.276 

1.019 

.876 

.792 

.744 

2 1.493 

1.192 

1.019 

.915 

.855 

1.816 

1.452 

1.239 

1.105 

1.030 

2.429 

1.943 

1.676 

1.533 

1.606 

1.125 

.901 

.783 

.715 

.675 

1.432 

1.142 

.980 

.884 

.828 

3 1.642 

1.310 

1.117 

1.000 

.931 

1.854 

1.563 

1.333 

1.184 

1.099 

2.551 

2.040 

1.761 

1.609 

1.671 

1.235 

.989 

.859 

.784 

.738 

1.536 

1.223 

1.048 

.945 

.883 

4 1.742 

1.389 

1.182 

1.057 

.982 

2.049 

1.639 

1.396 

1.238 

1.145 

2.634 

2.106 

1.819 

1.661 

1.715 

For  given  k,  n and  P = .75  (top),  .85  (second),  -90  (third), 
.95  (fourth),  .99  (bottom),  the  entries  in  this  table  are  the 
values  of  d which  satisfy 

J Gk(x+d)  g(x)  dx  = P 

where  G(.)  is  the  cdf  and  g(.)  the  pdf  of  the  median  of  a 
sample  of  size  n from  a standard  normal  population  ; n > 3 
is  an  odd  integer. 


CHAPTER  IV 


ON  SELECTION  OF  POPULATIONS 
CLOSE  TO  A CONTROL 


4.1  Introduction 

Let  tTq,  tt  k be  (k+1)  independent  populations  with 

densities  f(x,eQ),  fte,^) fCx,^),  respectively,  0i  e 0 C IR 

(i*0,l, . . . ,k) . The  population  is  a control  population.  Let 
E=  [a(0Q),  b(eQ) ] be  an  interval  in  IR,  and  let  D=  {e^, . . . , } 

Then  the  set  D can  be  partitioned  as  follows: 

D = D1 U D2  , where  Dj  * D HE  , D2  = D O Ec 

The  subset  of  all  populations  with  parameters  in  is  of  interest 

in  many  practical  situations.  The  problem  of  selection  of  a 

subset  containing  all  populations  better  than  a standard  corresponds 

to  E=(6g,®).  Gupta  and  Sobel  (1958)  have  considered  this  problem  for 

normal,  gamma  and  binomial  populations  and  have  investigated  a 

procedure  for  selecting  a subset  which  contains  all  populations 

* * 

better  than  a standard  with  probability  at  least  P , where  0<P  <1 
is  a preassigned  constant.  W.T.  Huang  (1975)  has  derived  a Bayes 
rule  for  the  problem  of  partitioning  a set  of  k normal  populations 
with  respect  to  a control. 


I 
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In  Section  A. 2 we  consider  the  case  E=[A+0  , B+0  ] where  A 

o o 

and  B(-°°<A<B<00)  are  known  constants.  The  problem  of  selecting 

a subset  containing  populations  with  parameters  in  [A+0  , B+0  ] 

o o 

arises  in  many  situations.  For  example,  in  dealing  with  matching 
parts,  one  is  interested  in  populations  which  are  close  to  a 
control.  A Bayes  procedure  has  been  derived  for  the 

above  problem  when  the  underlying  distributions  are  normal.  The 
problem  has  also  been  considered  from  the  subset  selection  approach 
and  modified  forms  of  rules  proposed  by  Gupta  and  Sobel  (1958) 
for  selection  of  normal  and  gamma  populations  have  been  investigated 
in  Section  4.3. 

In  the  next  two  sections  we  discuss  a slightly  different 

problem.  The  goal  in  this  case  is  to  select  the  subset  of  a given 

size  t (l<t£k)  of  populations  associated  with  d ^ j , . . . ,d ^ ^ , 

where  d.  B d(0, ,0  ) is  a distance  function  and  dr,,  <...<  dr,  , 
i 1 o'  [1]  - - [k] 

represent  the  ordered  d^.  In  Section  4.4  , we  have  derived  Bayes 

procedures  for  selection  of  t populations  associated  with  t 

2 

smallest  values  of  d(0.-0  )=(0  -0  ) ; normal,  binomial,  Poisson 

i o i o 

and  exponential  populations  are  discussed. 

In  Section  4.5  the  problem  of  selecting  the  t populations 
closest  to  control  has  been  considered  from  the  Empirical  Bayes 
approach;  Poisson,  geometric  and  binomial  populations  have  been 
discussed.  Empirical  Bayes  subset  selection  rules  for  the  problem 
of  selection  of  largest  or  smallest  parameter  have  been  investigated 
by  Deely  (1965). 
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A. 2 A Bayes  Rule  for  Selecting  All  Normal  Populations  Close  to 
Control 


h 


Let  ^o’^l* " ” ,1Tk  be  k+1  independent  normal  populations  with 

means  ^i  e ® Ik)  and  a common  known  variance  cr. 

We  say  that  tt±  is  "close"  to  irQ  if  A+6  <0^  B+© Q,  where  A and  B 

(-“<A<B<“  ) are  given  constants.  The  goal  is  to  select  a subset 

containing  all  populations  close  to  JIq. 

The  action  space  ^consists  of  all  subsets  of  {l,...,k}, 

k+1 

including  the  null  set.  Assume  that  the  loss  function  L:0  x_Q^»TR 


is  given  by 


L(i-S>- 1 V«0.b*v  «i>  ♦ ls  . 


[A+e0,B+e0]c  (ei} 
(A. 2.1) 


where  1^(0  is  the  indicator  function  of  a set  D. 

The  mean  0^  of  population  iTq  may  or  may  not  be  known.  We 
consider  the  two  cases  separately. 

(1)  Oq  known 

Let  xi^»****xin  be  an  independent  sample  of  size  n 
from  ir^(i=l, . . . ,k)  . Suppose  the  a prior  L distribution  g(£)  of 
£«(0^,...,  0^)  is  given  by 

k 0±-W  s 

8(e)-  n [ ± } 1 , 

i=i  T 

where  $ is  the  density  of  a standard  normal  random  variable. 

Then  the  posterior  distribution  of  0^  is 
* k . Q . ~ m . 

g (o)  - n [ i 

i«i  v 


i i 

I ! 


i / 


I 


: V.  ’S 
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where  and  v are  given  by  (2.5.2), 


For  a = A + 0^,  b = B + 6^,  let 


S1  = {i  • ®±  e [a,b],  V (mi)  >_ 


where  H'(y)  = $ (^  ~^)  — $ (^~  ^) , — <*>  < y < 


(4.2.2) 

(4.2.3) 


Let  be  the  rule  which  selects  with  probability  one.  We  show 
that  d1  is  a Bayes  rule  for  the  problem.  As  we  are  considering 
only  Bayes  rules,  it  suffices  to  show  that 


r(d1,x)  <_  r(d,x)  for  any  nonrandomized  rule  d (4.2.4) 


where  r(d,x),  the  Bayes  posterior  risk  in  using  the  rule  d,  is 
given  by 


r (d,x)  = Y ¥(m.)  + £ [1  - ¥(m.)] 

i$S  ieS  1 


(4.2.5) 


Let  a rule  d choose  a set  S (g^S^)  with  probability  one.  We  show 


that  r(d,x)  >^r(d^,3t).  The  following  cases  need  to  be  considered: 
(i)  There  exist  i and  j (1  < i,  j < k)  such  that 


ie  SCD  S1  , j e S O sj 


Let 


S’  = (i.j)  S 


where  (i,j)S  denotes  the  subset  obtained  from  S by  replacing 
j by  i.  Letting  d’  denote  the  rule  which  selects  the  set  S'  with 
probability  one,  we  have 


r(d,x)  - r(d',x)  = 2 ( yOr^)  -V  (m^)) 


id 


where  ¥(•)  is  given  by  (4.2.3). 

We  can  see,  as  in  Section  1.7,  that  the  function  V(y) 
is  symmetric  about  , and  strictly  decreases  with  |y- 

Since  nu  e [a,b],  nu  $ [a,b],  we  have 


and  hence  T(m^) 


> V(m  ). 


Thus 


r(d,x)  - r(d',x)  = 2 (J^)  - ¥(■  ))  > 0 
Since  S.^  can  be  obtained  from  S by  the  operation  used  in  (i) , the 
inequality  (4.2.4)  follows. 

(ii)  SCS1 

It  is  easily  seen  from  (4.2.5)  that 


r(d,x)  - r(d1,x)  = £ 


c [2  Y(m.) 
ieSnS;L  1 


- 1]  > 0 


by  definition  of  S^. 
(iii)  S D S, 


In  this  case  we  have 


(d,x)  ~ r(d  ,x)  = l fl  — 2V(m  )]  >_  0 


by  definition  of  S^.  It  follows  that  d^  is  a Bayes  rule  for  the 


problem. 
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(2)  0q  unknown 

Here  we  are  given  samples  xi:L,...,xin  from  all 

2 

k+l  normal  populations,  (i=  ,l,...,k);  ct  is  assumed  to  be 
known.  Suppose  that  the  a priori  distribution  of  £=  (Gq.G^ . . . ,efc) 


k Q ,-U 

s(eo,0i’ * * * ~ n I r <K  i~)l 

i=0  T 


(4.2.6) 


Then  g (60»91 9k) » the  posterior  distribution  of  £ given 


the  observations,  is 


* k 0 -m 

g (0  ,0  ,...,G  ) = n [i 

u j.  K i=0  v v 


(4.2.7) 


where  v and  are  given  by  (4.2.2). 

We  wish  to  find  a Bayes  rule  for  the  problem  of  selecting 
a subset  containing  all  population  for  which  G^e [A+Gq.B+Gq] , 
when  the  loss  function  is 


(i’S)  " i\s  1fA+e0»B+e()]  (Q±)  + Js  [1~I[a+9q»b+g0] (0i^ ^ 


Let  S^Urm^mQt  [A,B]  , * (n^  - n^)  > 1/2)  (4.2.8) 

where  ♦ is  same  as  ♦ given  by  (4.2.3) .with  v replaced  by  v/2. 
Also, let  d2  be  the  rule  which  selects  the  subset  defined  by 


* * - ~ ; 


■ 


■ 


■ 
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(4.2.8)  with  probability  one. Then  using  the  fact  that  the  poster- 

2 

ior  distribution  of  0J-0Q  is  NG^-iHq.Zv  ) we  can  show,  as  in  Case  (1), 
that 


r(d2  ,x)  1 r (d , x) 

i.e.,  d2  is  the  Bayes  rule  in  this  case. 

4.3  Selection  of  All  the  Populations  Close  to  Control  from 
Subset  Selection  Approach 


Let  ir^  be  k independent  populations  with  densities 

^(x.e^) f(x,0k)  (xeIR;  0^e0  C IR,  i=l k),  respectively. 

Also  let  Hq  be  a control  population  with  density  f(x,0o).  Let 
d:GH-  IR  be  a function  which  measures  the  distance  of  a point  0e0  from 
0q  • We  will  say  that  the  population  ir^  is  close  to  if  0£d(  ^)<_  a. 
In  this  section  we  will  propose  and  study  procedures  to  select  a 
subset  which  contains  the  set  {ir^:d(0^)£a}  with  probability 

•k 

at  least  P . 

Location  Parameter-  Normal  Populations  with  Common  Known  Variance 

2 

Let  ir^  be  normal  with  mean  0^  and  variance  a (i=0,l, . . . ,k) , 
and  define 

d(0A)  - I 01-OO I 

Case  A.  0^  known 

A sample  of  size  n^  is  taken  from  (1-1,..., k).  Let  x^ 

be  the  sample  mean  from  tt^.  For  selecting  a subset  containing  all 
populations  ir^  with  a*  consider  the  following  rule  R^: 


select  tt.  iff 


e _ i2*  < - < + d^a 

60  r-  1 Xi  1 90  + ^== 


(4.3.1) 
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The  constant  d is  chosen  to  satisfy 


p(cs|ra)>p*  , 0 < P*<1 


where  CS  stands  for  correct  solution,  i.e.  the  selection  of 


^ with  IS^-Ogl^a.  Let  and  k2  denote  the  true  number  of 
populations  with  | l^_a  and  1 0±-eo  | >a,  respectively,  so  that 
k^+k2=k.  If  we  let  primes  refer  to  values  associated  with  k^ 
populations  with  | l^.a,  then 

kl 

P(CS|Ra)  “ n P(0O-  ~ lx'<0 o+—  ) (4.3.2) 

i=l  ^ ^nj 

ki  (6  -e-)v^  (9  o-ep  ^ 

= n [$(— ~ -+ad)-*(  i --ad)] 

i=l  a a 


where  f(*)  represents  the  cdf  of  a standard  normal  random  variable. 


Consider  the  function 


(0n-u)»(nT  (9«-u)  >^rr 

'Hu)  = 4>(  — — - — - + ad)  - t>(  — ^ — - — ~ - ad) 


(4.3.3) 


We  can  verify  that 

(i)  lim  ¥(u)  = 0 = lim  Y(u) 


(ii)  f(0Q+u)  = ¥(0q-u),  i.e.,  the  function  ¥(u)  is  symmetric 


about  u=0q. 


(ill)  a-  Y(„)  . [♦( 


- ad)  -<K 


(e0-u)^ 


+ ad)  ] 


> 0 if  u<0( 
* 0 if  u»0. 


<0  if  u>0. 


Hence  the  function  Y(u)  is  increasing  if  u < eQ  , decreasing  if 
u > 6Q  and  attains  its  maximum  at  u-6Q.  It  follows  that 


inf  ’t'(u)  = V(0  -a)  = Y(0  -H») 

|u-0olfa  0 0 

Hence 


kl  ►'n  J 

infp(CS  |r  ) = n [4>(a(— - + d))  - $(a( — --d))] 

i=l  ° a 

k *^7  ✓nT 

> n [$(a(— - +d))  - *(a( — - - d))] 


The  constant  d is  obtained  by  equating  the  right  hand  side  of  the 
above  inequality  to  P . For  unequal  sample  sizes,  computation  of 
d is  difficult.  If  n^=n  V i=l,...,k,  we  have 

P(CS|R  ) > [*(a(-^+  d))  - Ha{-  - d))]k 

n a o 

and  hence  d can  be  found  by  solving 


+ d))  - 0(a(  -d)  -(P*)1^ 


Case  B.  0^  unknown 

In  this  case  observations  from  all  the  k+1  populations  are 
taken.  Let  x^  be  the  sample  mean  of  n^  observations  from  ir^ 

(i~ 1 » » • . » k) . Consider  the  following  selection  rule: 

V select  iff  xQ  - ^ + — 


(4.3.5) 


( y > 0 ) 


( y < 0 ) 


Figure  1.  The  Function  t(0) 


It  is  clear  from  Figure  1 that 


inf 

®ie{ le±“e0l-a} 


p(0o+a) 

L^e^a) 


if  y < 0 

If  y > 0 


From  (4.3.6)  and  (4.3.8)  we  have 

kl 

inf  P (CS | R_)  - / ® H f(e  «,,)♦(,)  dy  + 

<|o;-o0l<«>  i-1  ° 

k 

. 1 

+ L n y(0o-afy)f(y)  dy 
u i-1  u 


f o k r •'57 

/ JI  «t  — y + a{D-  — i)) 
1-1  ^ 


_ ^ i /n  " 

y - a(D  + — i))]  (f> (y)  dy 

0 


, <»  k r *^7 

+ / 0 nr*  (—  y + a(D  + — i ) 

0 **  s 


- *(— : y - a(D  (y)  dy 

/no 
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(4.3.9) 


Since  the  function  on  the  right  hand  side  of  (4.3.9)  increases 
from  0 to  1 as  d ranges  from  0 to  »,  the  equation  for  D-D(P*)  can 
be  solved  for  any  0 £ P*  < 1. 

Scale  Parameter-  Gamma  Populations  with  Known  Shape  Parameters 
Here  (i  = 0,1,..., k)  has  density 


g(  x ; 01,  ot) 


9 _cti^2 

-* *(V2>-  1 e-*/6i 

°1 

r t /)) 


(4.3.10) 


where  x,  0^,  > 0,  and  are  known  constants.  Let 
denote  the  corresponding  cdf.  In  this  case  we  say  that 
is  close  to  Vq  if 


G(x;  0lt  a A) 


1 

8 


< 
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where  3 > 1 is  a given  constant. 


Case  A.  0Q  known. 


T.  re- 
consider the  rule  R^:  select  it.  iff  -1  [_Ji  , 3eQc] , 


where 


n. 

l 

T.  = l 

1 j-1 


X.  . 
ij 


vi  = ni°i  , i = 

and  c > 1 is  chosen  so  as  to  satisfy  the  P*-condition. 

T. 

Using  the  fact  that  the  cdf  of  -i  is  G(t)  * G(t;l,v.)  we  obtain 


1 eeocvi 


6Vi 


p(cs,R;)  = [G(-^L_;l,v.)  - G(-^-;l,v.)]  (4.3.11) 


1 0- 

Here,  as  before,  k.  is  number  of  populations  it.  with  - < — < r 

i r x b — e_  — ’ 

0 

and  the  primes  refer  to  values  corresponding  to  the  populaitons  close 
to  TTg.  Define 


86  cv!  36  v.' 

Hi (6i)  - G(-^;l,v.)  - GC-^ijl.v .) 
1 1 


(4.3.12) 


It  is  easy  to  verify  that  the  function  ^(0!)  is  increasing  in  6£  if 


1.  reQ,  and  hence 


1 en 

inf  P(CS|R!)  = n H. (-£) 


iv  S’1 


<0 <36- 
3 - l 0 


i=l 
k 0 


> n H.  (— — ) 

— i 3 ’ 


i=l 


Hence  the  constant  c is  given  by  the  following  equation: 
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n [G(3  cv.  ;l,v.) 
i=l  1 1 


Case  B.  0^  unknown. 


In  this  case,  consider  the  rule  R' 

D 


3T  T.  BT-C 

select  it.  iff  < — < — — 

i v C - v.  - v„ 

0 l 0 


where  (i  = 0,1,..., k)  are  as  defined  in  case  A,  and  C > 1 is 


constant. 


Then,  as  in  case  A,  we  have 


e2cv. 


P(CS|R'  ) Li  n [G(  — 1 u;l,v  ) - G(-g-p — u;l,v  )] 
B 0 i=l  v0  1 L 0 1 


•g(u;l,v0)du 


The  constant  C can  be  obtained  by  equating  the  right  hand  side  of 
(4.3.14)  to  P*. 


Application  to  Selecting  Variances  of  Normal  Populations 


Let  7K  be  a normal  population  with  mean  and  variance  of 

(i  = 0,1,...,  k) . We  will  say  that  is  close  to  ttq  if 
0 

5 < < 3 where  0.  = 2 o?  (i  = 0,1,..., k)  and  6 > 1 is  a known 

8 0O  ii 

constant. 


"7T 
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For  the  case  where  the  ik  (i  = 0,1,..., k)  are  known,  we  consider 

a rule  based  on  S.  = £ (X..  - u.)2/n*  which  is  sufficient  for  a?. 

j=l  13  1 1 1 

If  Op  is  known,  the  rule  is  : select  iff 


2^a0  2 2 
— j— — < S7  < 2$do„ 
a — 1 — 0 


Using  the  fact  that 


n.S2 
1 1 

0. 

1 


1 < d 


1 2 

is  distributed  as  =xn  random-variable,  we 


2An. 
1 


can  show  that  the  equation  for  d is  the  same  as  (4.3.14)  with  = iu. 
If  the  means  (i  = 0,  l,...,k)  are  unknown  and  m > 1 (i  = l,...,k), 
then  we  use  x^  in  place  of  and  n.!  - 1 for  m.  The  constant  d is 
given  by  (4.3.14)  with  v.  * n.  • 1. 


4.4  Bayes  Rules  for  Selection  of  t Populations  Closest  to  Control. 


Let  7Tq,  *!»•  ••»*]<  be  k+1  independent  populations  with  density 
function  f(x,eQ),  f (x.Gj) , . . . ,f (x,6k) , i = 0,  l,...,k.  The  goal  here 
is  to  select  a subset  S of  a given  size  t (1  <_  t < k)  from  (l,...,k) 
such  that 


max  (6  - 0 )2  _<  (e  - e )2  , X i £ S 

j€S  1 u 1 u 

k 

The  action  space  U consists  of  ( ) elements: 


Q = (S  c (l,...,k)  : | S | = t) 


where  |s|  denotes  the  size  of  a subset  S. 
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I 


A reasonable  loss  function  L:  ©k+1x  a R in  this  si 


situation  is 


L(?'S)  = i€S  t(6j  " 6°)2  ' ,mi\  f9i  ' V2]  f4-4-1) 

Jtb  l<i<k  u 


We  will  derive  Bayes  rules  in  several  examples. 

1*  Normal  populations  with  common  known  variance. 


Let  tt±  be  normal  with  mean  6..  and  a known  variance  o2 
i = 0,  l»...,k.  The  mean  0Q  of  the  control  population  uQ  may 
or  may  not  be  known.  We  discuss  the  two  cases  separately. 


A.  0g  known 


Let  (j  = 1» • • • »n)  be  an  independent  sample  of  size  n from 

= l»...,k),  and  the  mean  of  the  sample,  i.e., 

n 

l x.  . 


- _ j=l 

xi  n * 


Assume  that  the  a priori  distribution  of  0 = (0  ) is 

“1  k 

k 

i e,  ■ w 
g(0)  = n [i 

i=l  T T 

where  <#> Cx)  is  the  pdf  of  a standard  normal  random  variable.  Then  the 
posterior  distribution  of  0 given  (x ...... x ) is 

A Iv 

k 

= iSi  ^ ♦("i  v %] 

where  nu  and  v are  given  by  (2.5.2) 
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The  Bayes  posterior  risk  in  choosing  a subset  S € U is  given  by 

.2  . 2.  i 

^|x] 

(4.4.2) 


•(S,x)  - l E[{(0  - 0 )2  - min  (0.  - 0_) 2) | x] 

j€S  3 0 Ki<k  1 0 - 


where  x = (x^ , . . . ,x^) . 

It  follows  from  (4.4.2)  that  a Bayes  rule  for  the  problem  is  to 
select  the  subset  S*  defined  by 


S*  = (j:  l E{(0, 

jfcS*  3 


Simple  calculation  gives 


0o)2|x} 


min  l E{ (0 . - 0j2|i) 

seujes  3 u " 


(4.4.3) 


E[(0j  - 90)  |x]  = D ^ + v 2 (4.4.4) 

2 

Let  Dfij  1 •••  1 denote  the  ordered  . It  is  clear  from 


where  D.  = (M.  - 9 ) 
J J 0J 


(4.4.3)  that  Bayes  rule  will  select  the  subset  of  populations  associated 
with  j i • . . } • 

B . 0Q  unknown 

Let  be  the  mean  of  n independent  observations  from  ir^ 

(i  = 0,  l,...,k)  and  assume  that  the  a priori  distribution  of 

(®qi  92,****®]()  ts 

Jl  . 0.  - v 


a i ♦<- 


-) 


8 (®0*  » • • • » ®^) 


T 
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As  in  Case  A we  have 


E[(0j  - eo^2|xo’  *!»•••»*]<]  = Dj  + 2v2  where  Dj  = (m^-m0)‘ 
and  therefore  the  Bayes  rule  selects  the  populations  associated  with 

{ D [ j ] » — * D *[t  j } * 

2.  Selection  in  terms  of  variances  of  normal  populations 


Here  it.  is  normal  with  mean  zero  and  variance  — (i  = 0,  l,...,k) 

i 

A.  0q  known 

Mere  xi j » • * - » xin  an  independent  sample  from  it.  (i  = l,...,k). 


2 1 r 2 

S.  = - > X. . 

1 n 


i = 1 , . . . ,k 


Assuming  that  0 = has  a priori  distribution  g(8)  given 

by  (2.5.7)  we  have,  as  in  example  2 of  Section  2.5, 


E{(0.  - 0o)2|s2,...,s2} 


i?T  - Vi')2  2eo2 

(bV)2(^+l)  2+a" 


where  a",  b'/  are  given  by  (2.5.9) 

Hence  the  Bayes  rule  will  select  the  populations  associated  with 


t smallest  D.,  where 
J 


1 

Dj  = by 
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B.  0g  unknown 

In  this  case  a sample  xQ1,...x0n  is  taken  from  t,q>  and  let 

9 n 2 

S0  = x0j’  Let  the  a Priori  distribution  of  (0  , 0 ) 

j — 1 " 1 K 


g(®0*  = 


n f(e.|b*fa») 


i=0  y2  1 


Then,  as  in  case  A,  we  have 


C f / a A \2|  2 2 2 1 2 + 21** 

EL(0  - e ) s0>  s ,...,sk]  = 

J I a"(bvr 


2 2 + a" 

bjb0  a"(b|j)2 


r2  + a"  2 

la"b'.'  " b"  ^ 

3 0 


2 + a" 


2 ♦ a"  a" 
a"  ” 2 + a" 
2 

/i  • » -v  6 


Thus  the  Bayes  rule  will  select  the  populations  associated  with  t smallest 


D!,  where 
J 


D.  = I ^ + a"  1 1 

i [“""j  *rj 


3 Binomial  Populations 


Here  we  are  given  irj,...tirk  with  probabilities  of  success 
®q»  0j*"'->0k»  respectively.  As  before,  we  consider  two  cases: 


A.  0q  known. 


Let  be  the  number  of  successes  in  n independent  trials  from 


no 


^ (i  = l,...,k).  Assume  0^  are  independent,  and  each  has  the  natural 
conjugate  beta  prior  with  parameters  a and  b,  given  by  (2.5.11).  Then, 
conditional  on  x = (Xj,...^),  0's  are  independent,  with  having  a 
beta  distribution  with  parameter  (x^  + a)  and  (n  + b - x^),  and  it  is 
easy  to  show  that 


E[(e.  - e0)2|x]  = 


(Xj  + a)(Xj  + a + 1) 

(n  + a + b)  (n  + a + b + 1) 


2Vxj  - 

n + a + b 


= [Xj  + a + i - (n  + a+  b + 1)©0] 2 + (n  + a + b + l)eQ(l  - 6Q)  - 

(n  + a + b)  (n  + a + b + 1) 


* 


2 

Setting  Dj  = [x^  + a + ^ -(n  + a + b + 1)®q]  , we  see  that  the 
Bayes  rule  selects  the  populations  associated  with  t smallest  . 

B.  0q  unknown 

In  this  case,  n independent  trials  are  performed  on  nQ  also; 
let  xQ  be  the  number  of  success  from  Then,  if  0^,  0^,...,0^  are 

assumed  to  be  independent,  each  with  natural  conjugate  beta  prior 
f (-|a,b),  we  have,  as  in  case  A, 


Ill 


where 


D!  = [x.  + a + i - 
J J 2 


n + a + b + 1 , .,2 

n * a ♦ b (x0  * a)l 


Hence  the  Bayes  rule  selects  populations  associated  with  the 


t smallest  D! • 
J 


4.  Poisson  populations 


Here  is  Poisson  with  parameter  0i  (i  = 0,  l,...,k). 


A.  0^  known 


Assume  are  independent,  and  each  has  the  natural  conjugate 


a priori  distribution  f C * | a * ,b • ) defined  by  (2.5.12).  Then,  if  x. 

Y1  1 


is  observed  from  ik,  we  have,  as  in  examples  4 of  Section  2.5, 

ElCe,  - e )2|x]  = d - — i , - 

J ° 3 4fl  + b ' i 1 + b 


’x.  + a' 

where  D.  = -J *-?■ 

J L 1 + b 


- (e, 


'0  2(1 

and  the  Bayes  rule  is  to  select  the  populations  corresponding  to 


{n[! 1 * — ’D[t] )* 


B.  Op  unknown 


Here  we  observe  xi  from  (i  = 0,  l,...,k).  Then  assuming 
(0O,  01,...,e^)  has  a priori  distribution 


g(0o,  01,...,0k)“  n f (0.|a',b') 


i=0  '1 


we  have. 
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2(a ' + xn) 


where 


E[(0.  - 0 ) |x  , x ,...,x  ] = D!  + - | 

J J (1  + b') 

xi  - C*0  ' & 2 

D'  = M _ 1^ 

j 1 1 + b’  J 


The  Bayes  rule  in  this  case  selects  the  populations  corresponding 
to  the  subset  { , . . . , } . 

5.  A problem  in  life  testing 


Let  irQ,  iTj, . . . ,1^  be  k independent  exponential  populations  with 
parameters  0Q,  respectively,  i.e.,  the  density  of  the  i-th 

population  is 

x 

l " e- 

f(x|6.)  = g e 1 , x > 0,  0.  > 0,  i = 0,  l,...,k 

Here,  0^  may  or  may  not  be  known.  As  before,  the  two  cases  will  be 
discussed  separately.  We  will  assume,  in  each  case,  that  0^  are 
independent,  and  each  has  the  same  a priori  density  h(*)-  The 
following  a priori  distributions,  used  by  Bhattacharya  (19671,  will 
be  considered: 


a-1 

(i)  h(0]  = Ca-:1i(a6)a  i , Ir„  cl  C6) , 0 < a < 6,  a>  1 (4.4.5) 

(B3*1  - aa_1)0a  l“*ej 


where  I,  .(•)  is  the  indicator  function  of  [a, 3]. 


_ £ 

(ii)  h(0)  = | e \ 0 > 0,  1 > 0 

A 


(4.4.6) 


i 
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(iii) 


e ® $V+1 

h(0)  = 2 2 , e > 0,  Ji,  V > 0 


(4.4.7) 


pr(v) 


The  prior  density  (4.4.7)  is  the  natural  conjugate  prior  density 
for  0. 


A.  0^  known. 


Here  n items  from  ir^  (i  = l,...,k)  are  put  on  a life  test,  and 


the  experiment  continued  until  the  first  r failures  are  observed. 


Let  be  the  accumulated  life  at  the  termination  of  the  test  on 


defined  by  (2.5.13).  Then  the  probability  density  of  T^  is 
given  by  (2.5.14).  Suppose  the  a priori  distribution  of  0 = 


(0 j , . . . , 0^)  is 


g(0)  = n h(0 . ) 

i=l 


where  h(')  is  one  of  the  three  a priori  denistles  given  above. 
The  posterior  density  g*(0|T)  of  0 = (0j,...,0^),  given 


T = (Tj, . . . ,T^) , in  each  case  is  obtained  from  results  of  Bhattacharya 


(1967) 


When  h(0)  is  given  by  (4.4.5)  we  have 


Ti 


g*(0|T) 


n [- 


i-i  er+a 

i 


e01  I 


That  is,  0^  are  conditionally  independent  and  have  truncated  inverted 


gamma  posterior  distributions  (see  Raiffa  and  Schlaifer  (1961)).  It 
follows  from  Bhattacharya  (1967)  that 


J 


EC6.lT) 


Y ‘(r  + a - 2,^) 
y*(r  + a - 1,T.)  Ti 


Y*(r  + a - 3,T. ) 


E(9i^  ■ rTr  ♦ a 1,T*)  • n 


where 


Y*(n,y)  =v(n,  £)  - y(n,  £) 


a $ n i 

Y(n,x)  = / e s ds  is  the  incomplete  gamma 


function  defined  for  x > 0. 


Hence 


Et(ej  - e0)2|T]  = 0q 


where 


fn  T*(r  + a - 3>Ti)  2 Y*(r  + a * 2*T^ 

j Y*(r  + a - 1,T')  j " * 0 y*(r  + a - I/fT)  j 


For  h(6)  given  by  (4.4.6)  we  have  „ n 

1 . u . 
r 1 I'* 

k -C  e~  + — ) 

g*(e|T)  « n ] , e . > o v i 

i=l  0 1 

k r i -(-? + r1  -i 

The  normalizing  constant  n [/  e de] 

i=l  0 9 


is  evaluated  by  using  an  integral  representation  of  Ky(z),  the 
modified  Bessel  function  of  the  third  kind  of  order  v,  given  in 
F.rdelyi  et  al  (1953)  [see  Bhattacharya  (1967)]. 


, K (2/f7I) 

E(e.|7)  = /xtT  , 

1 " 1 k (2/F7/T) 


2 Kr-3(2'/*i7*) 

E(ef|T)  = \t.-Z-1 — 

iKr_1(2v^7T) 


- ----- 
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and  hence 


where 


E[(0j  - eo)2|I]  = °j2)  + 60 


dC2)  = XTj  Kr_3C2/T7T)  - 260/vr7  Kr_2(2/T7I) 
J Kr-i(2/fi7I) 


In  case  h(0)  is  given  by  (4.4.7),  we  have 

p + T.  p + T..r+v+l 
k 0.  1 e. 

g*(e|T)  = n [-* i * j , 0.  > o 

i=l  (p  + T^I^r  +v  ) 1 


It  follows  from  Bhattacharya  (1967)  that 


P ♦ T. 


E[(e.  - e0)*|T]  = [ i 0n]2  LJL-V.  - 2 + — ° 

J 0 r ♦ V - 2 0 r + v - 1 r + V - 1 


Hence  the  Bayes  rule  in  the  first  two  cases  is  to  select  populations 

associated  with  the  corresponding  t smallest  D-statistics,  and 

. , v + Tt 

in  the  last  case,  is  to  select  the  t populations  for  which  - — - 


r + v - 2 


is  closest  to  0q. 


B.  0O  unknown 


Here  items  from  ifQ  also  are  put  on  the  same  life  test. Let  Tc 

be  the  total  accumulated  life  until  r failures.  In  this  case,  we 

assume  that  the  a priori  distribution  of  e = (e,,,  e 6,)  is 

0 1 k 


~~ 

! 
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8(®n»  ~ n h(9.) 

K i=o  1 


where  h(-)  is  one  of  the  density  functions  considered  above. 
Then,  proceeding  as  in  case  A,  we  can  show  that  the  Bayes  rule 
is  to  select  t populations  associated  with  t smallest  values  of 


y*(r  + a - 3,T  ) 2y*(r  + a - 2,T.)y*(r  + a - 2.TJ 

(i)  T - 


'(r  + a - 1,T.)  . i y*(r  + a - l,T.)y*(r  + a - 1 ,T  ) 


y*(r  + a - 3,T  ) 
y*(r  + a - 1,T0  *0 


when  h(*)  is  given  by  (4.4.5) 


(ii)  AT. 


ltr-3(2/f7I) 

1 Kr.i(2/V») 


- 2X/F7t 


i 0 


^-2(2/j7T)Kr-2(2/VX) 

^-l(2^VX)Kr-l(2/VX) 


1 


+ AT 


W2^ 

Kr-l(2'V7’ 


when  h(-)  is  given  by  (4.4.6) 


It  can  also  be  seen  that,  for  h(-)  given  by  (4.4.7),  the  Bayes 

rule  selects  the  t populations  for  which  (u  + T^)  is  closest  to 

r + v - 2 , . ^ 
r + v - 1 (lJ  V " 


4.5  Selection  of  the  Population  Closest  to  Control -an  Emprical 
Bayes  Approach 

In  this  Section  we  derive  empirical  Bayes  rules  for  selecting 
the  population  closest  to  control. 


r 
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We  are  given  k+1  (k  >_  2)  independent  populations  TrQ,  *2**,,,irk 
with  densities  f(x,eo),  f (x.ep , . . . ,f (x,0k) , respectively,  with 

respect  to  a a-finite  measure  p.  Our  interest  is  in  the  population 

2 

with  the  smallest  value  of  d(0.;0„)  = (0  - 0 ) , 

j 0 j 0 

We  now  formulate  the  problem  in  the  empirical 
Bayes  framework  of  Robbins  (1964). 

We  are  given: 

K+1  K+1 

(i)  a parameter  space  © c R , where  © is  a subset  of  R, 
the  real  line. 

(ii)  an  action  space  U=  {aj,...,a^},  where  a^  is  the  action 

which  selects  7r.  . 

l 

K+ 1 

(iii)  a continuous  loss  function  L:  © x ti  -+  R , with 

K+1 

L(6,aj)  denoting  the  loss  incurred  when  6 fc  © is 
the  state  of  nature  and  action  a^  is  taken.  For  our 
discussion 


" (0j  ' V ' min  (0.  - 0 )2,  l<j<k 
l<i<k  1 u 


(4.5.1) 


(iv)  an  a priori  distribution  G(0n,  0-,...,0.)  = n F(0.) 

k+1  j i 

of  (0q,  0^ , . . . ,0^)  on  © , where  F is  an 

absolutely  continuous  (a.c.)  distribution  function  on  ©. 


(v)  an  observable  random  variable  fc  , j = 0,  l,...,k 
on  which  o-finite  measure  p is  defined;  for  0 fc  ©,  x. 

j J 

has  probability  density  f ( * | 0 ^ ) with  respect  to  p. 
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We  wish  to  find  a decision  function 


tr:^r=  x & + a 

G J-0  > 


such  that 


R(t  G)  = / / L(tr(x),0)h(x|0)du(x)dG(6) 

G rk*l  & G " . " ' 

& 


(4.5.2) 


= min  R(t,G) 
t 


IS. 

where  h(x|0)  = n f(x. |0.),  x = OCq.Xj  , . . . ,xk) , 0 = (0O> 0J , . . . , 0R) 
j=0  J J ~ 


and  du(x)  = dvi(xQ)  x dp(x^)x. . .xdpfx^) 

Since  we  are  interested  in  rules  which  are  Bayes  with  respect 
to  a priori  distribution  G fc  1*,  attention  can  be  restricted  to 
nonrandomized  rules  . 


We  assume  that  a Bayes  rule  t„  for  each  G t l*  exists.  Then, 


Suppose  now  that  the  above  probelm  occurs  repeatedly  and 
independently,  with  the  same  unknown  G throughout.  Thus  we 
have 


<W* 

a sequence  of  pairs  of  random  variables  from  ir.  (j  = 0,  l,...,k) 
each  pair  being  independent  of  all  the  other  paris,  where  x 

jn 

given  = 0^  follows  the  distribution  f C * I ® j ) * A decision  about 
-n  = ^eon’  6ln’ ‘ ' ,6kn^  to  mac*e  a^ter  observing 

^-i  = ^x0i’  xii > • • • >x]^) » i = l,...,n}.  Following  Robbins  (1964) 

we  will  use  some  function  t (x, ;•)  of  x,,...,x  so  that 

n -l  -n  -i  -n 

t (•)  will  be  close  to  t_(-)  in  the  sense  of  Bayes  risk.  To  make 
n u 

this  more  precise,  we  state  a definition  from  Robbins  (1964) : 

Definition  4.6.1:  An  empirical  decision  function  T = {t^}  is 

said  to  be  asymptotically  optimal  (a.o)  relative  to  G if 


from  (4.6.1) 
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Let  Ain(-)  E Ain^i*<--»5n;-)  (i  = n > 1)  be 


sequence  such  that 


AinW  - Va.,x) 


Define 


tnCxl  = a£,  where  l (1  1 * 1 k)  is  such  that 


A = min{0,A~  (x) , . . . ,A,  (x) } 


’Am  ' 0 


(4.5.6) 


It  follows  from  Corollary  1 of  Robbins  (1964)  that  the  sequence 

T = {t  } is  a.o.  relative  to  G. 
n 

We  will  find  the  sequence  Ain  for  some  discrete  distributions. 
Following  Robbins  (1955),  let 


fG(xj)  = / f(x  |0)  dF(0),  j = 0,  l,...,k 


(4.5.7) 


E [9  j I xq>  x » * * * » x|^  1 = 


/../  0P  n [f(x  |0  )dF(0  )] 

0©  J i=o  1 1 l 


/../  n f(x  |0  )dF(0.) 
© © i=0 


/ ©P  f(x.|0.)  dF(0.) 


3 3 3 


fG(xj) 


We  consider  several  examples. 


(4.5.8) 
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( i ) Poisson  populations 


Here  f(x|e)  = 


»x  = o,  i1.,.,e>o 


Let  be  a sample  of  size  n from  n.,  j = 0,  l,...,k. 

Also  let  3 be  the  family  of  all  a.c.  distributions  on  the  positive 
real  line,  and  define  a family  of  distributions  g on  1R^  + ^ by 


g = (G:  Gfe0,  0J, . . . ,6.)  = n F(e  ),  F fc  3}  (4.5.9) 

j=0  J 


From  (4.6.8)  we  have,  for  p = 1,  2, 


Pi  (x-i+P)!  f _ ( x . +p ) 

E(eP  x , x ,...,x.)  = — 2-, — S.  I 

J 0 1 k x7!  ?g  (x  j ) 


number  of  X..,...,X.  = x 

pin(x) il m 

jn  n 


(4.5.10) 


»WW  . W’1 

jn  - x. ! p.  (x.)  » J u>  1 * • . . ,k 

J jn  l 

(4.5.11) 


Since  the  empirical  frequence  function  p^(x)  converges  in  probability 
to  the  (unknown)  marginal  density  f (•),  we  have 


^„p,<;'nSs  E(»?i5) 


. v g e g 


(4.5.12) 
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It  follows  that  the  sequence  defined  by 


V5i v;'  - 


jn 


' - 


jn 


(4.5.13) 


coverges  in  probability  to  A^a^.x)  given  by  (4.6.5). 

(ii)  Geometric  Distribution 

Let  x. x.  be  a sample  size  n from  v.  which  has  a 
J1  jn  j 

geometric  distribution  given  by 


f(x|e  ) = (1  - 0)0*  x = o,  l,  2,...,  o < 6 < l,  j 
J J J j 

Here  d is  the  family  of  a.c.  distributions  on  (0,1)  and  l,  is 
defined  by  (4.6.9) . 

From  (4.6.7)  and  (4.6.8)  we  have 


= 0, 1 , . . . ,k 


f(e?|x) 

In  this  case  we  define 


fG(Vp) 

p = 1>2 


number  of  X. X.  which  equal  x.+p 

h p (x)  = li -22 - 1— 

jn  - number  of  X. X.  which  equal  x. 

Jl  jn  M 3 


and  Ajn(x^ , . . . ,xn;x)  by  (4.6.13). 


(iii)  Binomial  populations 

Here  f(x|e)  = (£)  0x(l-0)r‘x  , x = 0,  l,...,r,  0 < 9 < 

The  family  l*is  same  as  in  example  (ii)  discussed  above. 


1 
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i 


In  this  case,  no  asymptotically  optimal  solution  seems  to 
exist.  Following  Robbins  (1964)  we  propose  a rule  which  will  do 


about  as  well  as  the  (unknown)  Bayes  rule  tfi  for  large  samples. 


It  is  easily  seen  that 


3 - 


(V  fG.r.p  (VP) 

x . 

(r+p  )fr  (X.) 

3 

Xj  +P J G,r  v 

P = 

(xj}  >Say 

0,  1 , — ,r  , ( r > 2 ) 


where 


fG  r(y)  = / U ey(l-e)r‘y  dG(e) 

u,r  o y 


.(p) 


.(p) 


Let  , ‘ ’ ■ ,xjn  * * ' ‘ denote  the  sequence  of  number  of  successes 

in  1st  r-p  out  of  the  r trials  (which  provided  ,...), 

with  X... 

3i  3i 


Define 


(p,  («;p  > 

H)PJ(i)  = 

jn 


( T ) h^r"p^ (x  ) 
VjV  3n  1 V 


where  (r_P)  number  of  . . . ,x?p^  which  equal  x. 

wnere  h.  (x)  = Ji iO 3. 

3n  - n 


Since  h[*"p^(x)  £ f 


3n 


r y ( x ^ ) as  n ->  we  have 
U » 1 "■  P J 


(r:P)  fG 

„(P)  w s —J  ■ . „<p).p 


3n 


(x'V  wv 


i 
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It  follows  that  the  sequence  A.  defined  below  converges  in  probability 


to  AG^aj  *5^ : 


V51'— = [Hjn  " 2Hjn«  V*)] 


' K.  W - Hm(5^»Wl 


(4.5.14) 


For  large  r,  throwing  away  two  observations  does  not  sacrifice 
much  information,  but  it  is  not  at  all  clear  that  the  sequence 
Ajn  defined  by  (4.6.14)  is  the  best  sequence. 
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A t-slippage  problem  can  roughly  be  described  as^follows:  Test  the 
homogeneity  of  the  given  k(>2)  populations  against  ( ) alternative  hypotheses 
that  t of  these  populations  are  different  from  the  rest, that  is  to  say, these 
have  slipped.  In  Chapter  I the  form  of  symmetric  invariant  Bayes  procedures 
has  been  derived  for  the  t-slippage  problem,  when  the  underlying  densities 
are  of  location  or  scale  type,  and  have  a sufficient  maximum  likelihood 
estimate  of  the  parameter  in  case  the  populations  are  homogeneous.  Locally 
best  tests  of  some  t-slippage  problems  in  nonparametric  situations  have  also 
been  derived . 

In  Chapter  II  the  problem  of  selecting  a subset  of  a random  size  which 
contains  the  populations  associated  with  t largest  parameters  is  investigated. 
It  is  shown  that  for  a loss  function  which  is  linear  in  the  parameters  the 
Bayes  rule  for  any  symmetric  a priori  distribution  selects  exactly  t populati- 
ons, provided  assumptions  of  symmetry  and  monotonicity  on  the  density 
function  are  satisfied . For  some  non-linear  loss  functions,  Bayes  rules  have 
been  investigated  for  normal,  exponential,  Poisson  and  binomial  populations, 
when  the  a priori  distributions  are  known. 

Chapter  III  deals  with  the  problem  of  selecting  a subset  containing  the 
normal  population  associated  with  the  largest  mean. Gupta -type  rules  based  on 
sample  medians  have  been  investigated.  Numerical  comparisons  are  carried  out 
in  the  case  of  equally  spaced  normal  means  which  show  that  the  selection  rule 
based  on  the  sample  mean  is  'better'  than  the  proposed  rule  based  on  sample 
medians.  However,  if  the  normal  populations  are  contaminated, the  proposed  rule 
based  on  sample  medians  is  superior  to  the  rule  based  on  sample  means  in  terms 
of  the  asymptotic  relative  efficiency. 

Chapter  IV  consists  of  investigations  of  procedures  for  the  following 
problems:  (i)  To  select  a subset  of  random  size  which  contains  all  populations 
'close'  to  a given  control  population,  and  (ii)  to  select  the  t populations 
closest  to  control.  Bayes  and  Gupta  type-rules  have  been  investigated. 
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